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FIELD OF THE INVENTION 

This application relates to the identification of biomarkers in blood, the identified 
biomarkers and compositions thereof, as well as methods related to the use of the biomarkers 
10 to monitor an individual's condition. 
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BACKGROUND 

The blood is a vital part of the human circulatory system for the human body. 
5 Numerous cell types make up the blood tissue including leukocytes consisting of 
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granulocytes (neutrophils, eosinophils and basophils), and agranuloctyes ( lymphocytes, 
and monocytes), erythrocytes, platelets, as well as possibly many other undiscovered cell 
types. 

The turnover of cells in the hematopoietic system is enormous. It was reported that 
5 over one trillion cells, including 200 billion erythrocytes and 70 billion neutrophilic 
leukocytes, turn over each day in the human body (Ogawa 1993). 

The prior art is deficient in simple non-invasive methods to diagnose, prognose, and 
monitor progression and regression of disease and to identify markers related to one or 
more conditions. Although there has been a recent use of expression array phenotyping for 

10 identification and/or classification of biomarkers of disease, the source of biomarkers has 
been limited to those which are differentially expressed in tissue, thus requiring invasive 
diagnostic procedures (e.g. see Alon U, Barkai N, Notterman DA, Gish K, Ybarra S, Mack 
D, Levine AJ: Broad patterns of gene expression revealed by clustering analysis of tumor 
and normal colon tissues probed by oligonucleotide arrays. Proc Natl Acad Sci USA 1999, 

15 96:6745-6750; Schummer M, Ng WV, Bumgarner RE, Nelson PS, Schummer B, Bednarski 
DW, Hassell L, Baldwin RL, Karlan BY, Hood L Comparative hybridization of an array of 
21500 ovarian cDNAs for the discovery of genes overexpressed in ovarian 
carcinomas.Gene 1999, 238:375-385; van't Veer LJ, Dai H, van de Vijver MJ, He YD, Hart 
AA, Mao M, Peterse HL, van der Kooy K, Marton MJ, Witteveen AT, et al.: Gene 

20 expression profiling predicts clinical outcome of breast cancer. Nature 2002, 415:530-536; 

SUMMARY OF THE INVENTION 

The present invention provides minimally invasive methods to identify biomarkers 
useful for diagnosing a condition, and biomarkers and compositions thereof, wherein the 
biomarkers of the condition are identified from a simple blood sample. Also encompassed 

25 are methods and kits utilizing said biomarkers, especially to diagnose, prognose, and 
monitor conditions, which include disease and non disease conditions.. Accordingly, 
methods of diagnosing disease, monitoring disease progression, and differentially 
diagnosing disease are provided, as well askits useful in diagnosing, monitoring disease 
progression and differentially diagnosing disease. 

30 The process described herein requires the use of a blood sample and is, therefore, 

minimally invasive as compared to conventional practices used to detect disease using 
tissue sample biomarkers. 
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Also disclosed are methods representative of means of identifying biomarkers 
which are differentially expressed as between two populations, a first population having a 
condition and a second population having a second condition, or not having a condition. 
The biomarkers thus identified can be used to diagnose an individual with a condition, or 
5 differentially diagnose an individual as having either a first or second condition. 

Other and further aspects, features, and advantages of the representations of 
the methods and products presented herein will be apparent from the following description of 
the presently preferred embodiments. These embodiments are given for the purpose of 
disclosure. 

10 BRIEF DESCRIPTION OF THE DRAWINGS 

The above-recited features, advantages and objects of the invention, as well as 
others which will become clear, are attained and can be understood in detail and more 
particular descriptions of the invention briefly summarized above may be had by reference 
to certain embodiments thereof which are illustrated in the appended drawings. These 
15 drawings form a part of the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and therefore are not to be 
considered limiting in their scope. 

Figure 1 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals having both osteoarthritis and hypertension as compared with 
20 RNA expression profiles from individuals without either osteoarthritis or hypertension 
("normal"). 

Figure 2 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having both osteoarthritis and who were 
25 obese as described herein as compared with RNA expression profiles from individuals 
without either obesity or osteoarthritis ("normal"). 

Figure 3 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having both osteoarthritis and allergies as 
30 described herein as compared with RNA expression profiles from individuals without either 
allergies or osteoarthritis ("normal"). 
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Figure 4 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals having osteoarthritis and who were subject to systemic steroids as 
described herein as compared with RNA expression profiles from individuals not taking 
systemic steroids and without osteoarthritis ("normal"). 

5 

Figure 5 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals having hypertension as compared with RNA expression profiles 
from samples of both non-hypertensive and normal individuals. 

10 Figure 6 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as obese as described herein as compared 
with RNA expression profiles from normal and non-obese individuals. 

Figure 7 shows a venn diagram illustrating a summary of the analysis comparing 
15 hypertension and OA patients vs. individuals without hypertension or OA (Table 1 A), 

hypertension and OA patients vs. OA patients (Table 1G), and the intersection between the 
two populations of genes (Table 1H). 

Figure 8 shows a venn diagram illustrating a summary of the analysis comparing obesity and 
OA patients vs. individuals without obesity or OATable IB), obesity and OA patients vs. OA 
20 patients (Table II), and the intersection between the two populations of genes (Table 1 J). 

Figure 9 shows a venn diagram illustrating a summary of the analysis comparing allergy and 
OA patients vs individuals without allergy or OA (Table 1C), allergy and OA patients vs. OA 
patients (Table IK), and the intersection between the two populations of genes (Table 1L). 

Figure 10 shows a venn diagram illustrating a summary of the analysis comparing systemic 
25 steroids and OA patients vs. individuals without OA and not exposed to systemic steroids 
(Table ID), systemic steroids and OA patients vs. OA patients (Table 1M), and the 
intersection between the two populations of genes (Table IN). 

Figure 11 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having OA and being on of three types of 
30 systemic steroids, including hormone replacement therapy, birth control and prednisone. 



13 



WO 2004/112589 PCT/US2004/020836 
Figure 12 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having type 2 diabetes as described herein 
as compared with RNA expression profiles from normal and non-type 2 diabetes 
individuals. 

5 

Figure 13 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were identified as having hyperlipidemia as described 
herein as compared with RNA expression profiles from normal and non-hyperlipidemia 
patients. 

10 

Figure 14 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having lung disease as described herein as 
compared with RNA expression profiles from normal and non lung disease individuals. 

15 Figure 15 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were identified as having bladder cancer as described 
herein as compared with RNA expression profiles from non bladder cancer individuals. 

Figure 16 shows a diagrammatic representation of RNA expression profiles of Whole 
20 blood samples from individuals who were identified as having advanced stage bladder 
cancer or early stage bladder cancer as described herein as compared with RNA expression 
profiles from non bladder cancer individuals. 

Figure 17 shows a diagrammatic representation of RNA expression profiles of Whole 
25 blood samples from individuals who were identified as having coronary artery disease 
(CAD) as described herein as compared with RNA expression profiles from non-coronary 
artery disease individuals. 

Figure 1 8 shows a diagrammatic representation of RNA expression profiles of Whole 
30 blood samples from individuals who were identified as having rheumatoid arthritis as 
described herein as compared with RNA expression profiles from non-rheumatoid arthritis 
individuals. 
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Figure 19 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having depression as described herein as 
compared with RNA expression profiles from non-depression individuals. 

5 Figure 20 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having various stages of osteoarthritis as 
described herein as compared with RNA expression profiles from individuals without 
osteoarthritis. 

Figure 21 shows a diagrammatic representation of RNA expression profiles of Whole blood 
10 samples from individuals who were identified as having liver cancer as described herein as 
compared with RNA expression profiles from individuals not having liver cancer. 

Figure 22 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having schizophrenia as described herein as 
compared with RNA expression profiles from individuals not having schizophrenia. 

15 Figure 23 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having symptomatic or asymptomatic 
Chagas' disease as described herein as compared with RNA expression profiles from 
individuals without Chagaas Disease. 

Figure 24 shows a diagrammatic representation of RNA expression profiles of Whole blood 
20 samples from individuals who were identified as having asthma and OA as compared with 
individuals having OA but not asthma . 

Figure 25 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having manic depression syndrome as 
compared with those individuals who have schizophrenia. 

25 Figure 26 shows a representation of the presentation of various stages of OA in patients of 
with respect to the age group of the patients. 

Figure 27 shows RT-PCR of overexpressed genes in CAD peripheral blood cells identified 
using microarray experiments, including PBP, PF4 and F13A. 

Figure 28 shows the the "Blood Chip", a cDNA microarray slide with 10,368 PGR products 
30 derived from peripheral blood cell cDNA libraries. Colors represent hybridization to probes 
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labeled mth Cy3 (green) or Cy5 (red) . Yellow spots indicate common hybri dization between 
both probes. In slide A, normal bl ood cell RNA samples were labeled with Cy3 and CAD 
blood cell RNA samples were labeled with Cy5. In slide B, Cy3 and Cy5 were switched to 
label the RNA samples. (Cluster analysis revealed distinct gene expression profiles for 
5 normal and CAD samples.) 

DETAILED DESCRIPTION: 

Disclosed herein are methods that would be understood by a person skilled in the art 
as representing means of identifying biomarkers which correlate to one or more nucleic 
acid transcripts which are differentially expressed in blood, according to a condition of 
interest, wherein the condition of interest includes a disease, a stage of disease, as well as 
other non-disease conditions.Also disclosed herein is a composition comprising the 
biomarker(s) identified as such, the biomarker(s )themselves, as well as methods of using 
the biomarker(s). Such methods include using the biomarkers to diagnose an individual 
as having a condition of interest or a certain stage of a condition of interest, andto 
differentiate between two or more conditions. Products which are representative of kits 
useful in diagnosing an individual as having a a condition of interest are also disclosed. 

In one embodiment of the invention, a blood sample is collected from one or more 
individuals having a condition of interest, and RNA is isolated from said blood sample. In 
a preferred embodiment the blood sample is whole blood without prior fractionation. In 
20 another preferred embodiment, the blood sample is peripheral blood leukocytes. In another 
preferred embodiment, the blood sample is peripheral blood mononuclear cells (PBMCs). 

Biomarkers are identified by measuring the level of one or more species of RNA 
transcripts or a synthetic nucleic acid copy (cDNA, cRNA etc.) thereof, from one or more 
individuals who have a condition of interest or who do not have said condition of interest 

25 and/or who are healthy and normal. . In one embodiment, the level of one or more species 
of RNA transcripts is determined by quantitating the level of an RNA species of the 
invention. In one embodiment for example, mass spectrometry may be used to quantify 
the level of one or more species of RNA transcripts (Koster et al., 1996; Fu et al., 1998). In 
a preferred embodiment, the level of one or more species of RNA transcripts is determined 

30 using microarray analysis. In another preferred embodiment, the level of one or more 

species of RNA transcripts is measured using quantitative RT-PCR. In accordance with the 
present invention, there may be employed other conventional molecular biology, 
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microbiology, and recombinant DNA techniques within the skill of the art in order to 
quantitatively or semi-quantitatively measure one or more species of RNA transcripts. 
Such techniques are explained fully in the literature. See,, e.g., Sambrook, Fritsch & 
Maniatis, "Molecular Cloning: A Laboratory Manual (1982); "DNA Cloning: A Practical 
5 Approach," Volumes I and II (D.N. Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. 
Gait ed. 1984); "Nucleic Acid Hybridization" [B.D. Hames & S.J. Higgins eds. (1985)]; 
"Transcription and Translation" [B.D. Hames & S.J. Higgins eds. (1984)]; "Animal Cell 
Culture" [R.I. Freshney, ed. (1986)]; "Immobilized Cells And Enzymes" [IRL Press, 
(1986)]; B. Perbal, "A Practical Guide To Molecular Cloning" (1984). In a preferred 
10 embodiment quantitative RT-PCR may be used for the purpose of measuring/quantitating 
transcripts in blood. 

In a preferred embodiment, expression levels of one or more species of RNA 
transcripts from a population of samples having a condition of interest are compared those 
levels from a population of samples not having the condition of interestso as to identify 

15 biomarkers which are able to differentiate between the two populations. In another 

preferred embodiment, expression levels of one or more species of RNA transcripts from a 
population of samples having a first condition of interest are compared with a those from a 
population of samples having a second condition of interest so as to identify biomarkers 
which can differentiate between said conditions. In another preferred embodiment, when 

20 comparing two populations of individuals to identify biomarkers of a condition of interest, 
the populations are chosen such that the populations share at least one phenotype which is 
not the condition of. More preferably the populations have two or more, three or more, four 
or more etc. phenotypes in common. By phenotype is meant any trait which is not the 
condition of interest, for example, in a preferred embodiment individuals within the 

25 populations being used to identify biomarkers of a condition are of a similar age, sex, body 
mass index (BMI). 

The identified biomarkers can be used to determine whether an individual has a 
condition of interest. As would be understood to a person skilled in the art, one can utilize 
the biomarkers identified, or combinations of the biomarkers identified, to characterize an 
30 unknown sample in accordance with "class prediction" methods as would be understood by a 
person skilled in the art. 
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The following terms shall have the definitions set out below: 

A "cDNA" is defined as copy-DNA or complementary-DNA, and is a product of a 
reverse transcription reaction from an mRNA transcript. "RT-PCR" refers to reverse 
transcription polymerase chain reaction and results in production of cDNAs that are 
complementary to the mRNA template(s). RT-PCR includes "QRT-PCR", quantitative real 
time reverse transcription polymerase chain reaction which uses a labeling means to 
quantitate the level of mRNA transcription and can either be done using the one step or two 
step protocols for the making of cDNA and the amplification step. The labeling means can 
include SYBR® green intercolating dye; TaqMan® probes and Molecular Beacons® as 
well as others as would be understood by a person skilled in the art. 

The term "oligonucleotide" is defined as a molecule comprised of two or more 
deoxyribonucleotides and/ or ribonucleotides, preferably more than three. Its exact size 
will depend upon many factors which, in turn, depend upon the ultimate function and use of 
the oligonucleotide. The upper limit may be 15, 20, 25, 30, 40 or 50 nucleotides in length. 
The term "primer 11 as used herein refers to an oligonucleotide, whether occurring naturally 
as in a purified restriction digest or produced synthetically, which is capable of acting as a 
point of initiation of synthesis when placed under conditions in which synthesis of a primer 
extension product, which is complementary to a nucleic acid strand, is induced, i.e., in the 
presence of nucleotides and an inducing agent such as a DNA polymerase and at a suitable 
temperature and pH. The primer may be either single-stranded or double-stranded and 
must be sufficiently long to prime the synthesis of the desired extension product in the 
presence of the inducing agent. The exact length of the primer will depend upon many 
factors, including temperature, source of primer and the method used. For example, for 
diagnostic applications, depending on the complexity of the target sequence, the 
oligonucleotide primer typically contains 15-25 or more nucleotides, although it may 
contain fewer nucleotides. The factors involved in determining the appropriate length of 
primer are readily known to one of ordinary skill in the art. 

As used herein, random sequence primers refer to a composition of primers of 
random sequence, i.e. not directed towards a specific sequence. These sequences possess 
sufficient nucleotides complementary to a polynucleotide tohybridize with said 
polynucleotide and the primer sequence need not reflect the exact sequence of the template. 
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"Restriction fragment length polymorphism" refers to variations in DNA sequence 
detected by variations in the length of DNA fragments generated by restriction 
endonuclease digestion. 

A standard Northern blot assay can be used to ascertain the relative amounts of 
mRNA in a cell or tissue obtained from plant or other tissue, in accordance with 
conventional Northern hybridization techniques known to those persons of ordinary skill in 
the art. The Northern blot uses a hybridization probe, e.g. radiolabeled cDNA, either 
containing the full-length, single stranded DNA or a fragment of that DNA sequence at 
least 20 (preferably at least 30, more preferably at least 50, and most preferably at least 100 
consecutive nucleotides in length). The DNA hybridization probe can be labeled by any of 
the many different methods known to those skilled in this art. The labels most commonly 
employed for these studies are radioactive elements, enzymes, chemicals which fluoresce 
when exposed to ultraviolet light, and others. A number of fluorescent materials are known 
and can be utilized as labels. These include, for example, fluorescein, rhodamine, 
auramine, Texas Red, AMCA blue and Lucifer Yellow. A particular detecting material is 
anti-rabbit antibody prepared in goats and conjugated with fluorescein through an 
isothiocyanate. Proteins can also be labeled with a radioactive element or with an enzyme. 
The radioactive label can be detected by any of the currently available counting procedures. 
The preferred isotope may be selected from 3 H, 14 C, 32 P, 35 S, 36 C1, 51 Cr, 57 Co, 58 Co, 59 Fe, 
90 Y, 125 I, 131 I, and 186 Re. Enzyme labels are likewise useful, and can be detected by any of 
the presently utilized colorimetric, spectrophotometries fluorospectrophotometric, 
amperometric or gasometric techniques. The enzyme is conjugated to the selected particle 
by reaction with bridging molecules such as carbodiimides, diisocyanates, glutaraldehyde 
and the like. Many enzymes which can be used in these procedures are known and can be 
utilized. The preferred are peroxidase, p-glucuronidase, p~D-glucosidase, P-D- 
galactosidase, urease, glucose oxidase plus peroxidase and alkaline phosphatase. U.S. 
Patent Nos. 3,654,090, 3,850,752, and 4,016,043 are referred to by way of example for their 
disclosure of alternate labeling material and methods. 

As defined herein, a "nucleic acid array" and "microarray" refers to a plurality of 
unique nucleic acids (or "nucleic acid members") attached to a support where each of the 
nucleic acid members is attached to a support in a unique pre-selected region. In one 
embodiment, the nucleic acid probe attached to the surface of the support is DNA. In a 
preferred embodiment, the nucleic acid probe attached to the surface of the support is either 
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cDNA or oligonucleotides. In another preferred embodiment, the nucleic acid probe attached 
to the surface of the support is cDNA synthesized by polymerase chain reaction (PCR). The 
term "nucleic acid", as used herein, is interchangeable with the term "polynucleotide". In 
another preferred embodiment, a "nucleic acid array" refers to a plurality of unique nucleic 
5 acids attached to nitrocellulose or other membranes used in Southern and/or Northern blotting 
techniques. 

As used herein, " an individual" refers to a human subject as well as a non-human 
subject such as a mammal, an invertebrate, a vertebrate, a rat, a horse, a dog, a cat, a cow, a 

10 chicken, a bird, a mouse, a rodent, a primate, a fish, a frog and a deer. The examples herein 
are not meant to limit the methodology of the present invention to a human subject only, as 
the instant methodology is useful in the fields of veterinary medicine, animal sciences and 
such. The term "individual" refers to a human subject and a non-human subject who are 
condition free and also includes a human and a non-human subject diagnosed with one or 

15 more conditions, as defined herein. "Co-morbid individuals" or "comorbidity" or 

"individuals considered as co-morbid" are individuals who have more than one condition as 
defined herein. For example a patient diagnosed with both osteoarthritis and hypertension 
is considered to present with comorbidities. 

As used herein, "detecting" refers to determining the presence of a one or more 
20 species of RNA transcripts, for example cDNA, RNA or EST, by any method known to 
those of skill in the art or taught in numerous texts and laboratory manuals (see for 
example, Ausubel et al. Short Protocols in Molecular Biology (1995) 3rd Ed. John Wiley & 
Sons, Inc.). For example, methods of detection include but are not limited to, RNA 
fingerprinting, Northern blotting, polymerase chain reaction, ligase chain reaction, Qbeta 
25 replicase, isothermal amplification method, strand displacement amplification, transcription 
based amplification systems, nuclease protection (SI nuclease or RNAse protection assays) 
as well as methods disclosed in WO88/10315, WO89/06700, PCT/US87/00880, 
PCT/US89/01025. 

As used herein, a "condition" of the invention refers to a mode or state of 
30 being including a physical, emotional, psychological or pathological state. A condition can 
be as a result of both "genetic" and/or "environmental" factors. By "genetic factors" is meant 
genetically inherited factors or characteristics inherent as a result of the genetic make up of 
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the individual. By "environmental factors" is meant those factors which are not genetically 
inherited, but which are the result of exposure to internal or external influences. In one 
embodiment of the invention, a condition is a disease as defined herein. In another 
embodiment of the invention, a condition is a stage of a disease as defined herein. In yet 
5 another embodiment of the invention, a condition is a mode or state of being which is not a 
disease. For example in one embodiment, a condition which is not a disease is a condition 
resulting from the progression of time. A condition resulting from progression of time can 
include, but is not limited to: memory loss, loss of skin elasticity, loss of muscle tone, and 
loss of sexual desire. In a further embodiment of the invention a condition which is not a 

10 disease is a treatment. A treatment can include, but is not limited to disease modifying 

treatments as well as treatments useful in mitigating the symptoms of disease. For example 
treatments can include drugs specific for a disease of the invention. In a preferred 
embodiment, treatments can include drugs specific for Alzheimer's, Cardiovascular disease, 
Manic Depression Syndrome, Schizophrenia, Diabetes and Osteoarthritis. For example, 

15 treatments can include but are not limited to VIOXX®, Celebrex®, NSAIDS, Cortisone, 
Visco supplement, Lipitor®, Adriamycin®, Cytoxan®, Herceptin®, Nolvadex® Avastin®, 
Erbitux®, Fluorouracil®, Largactil®, Sparine®, Vesprin®, Stelazine®, Fentazine®, 
Prolixin®, Compazine®, Tindal®, Modecate®, Moditen®, Mellarin, Serentil, Norvane, ®, 
Fluanxol®, Clopixol®, Taractan®, Depixol®, Clopixol®, Haldol®, Haldol® Decanoate, 

20 Orap®, Inapsine®, Imap®, Semap®, Loxitane®, Daxol®, lithium, anticonvulsants (for ex. 
carbamazepine) and antidepressants and Moban®. More generally and addition, a treatment 
can include any treatment or drug described in the Compendium of Pharmaceuticals and 
Specialties, Canadian Pharmaceutical Association; 26 th edition, June, 1991; Krogh, 
Compendium of Pharmaceuticals and Specialties, Canadian Pharmaceutical Association; 27 th 

25 edition, April, 1992. In a further embodiment, a condition of the invention which is not a 
disease is a response to environmental factors including but not limited to pollution, 
environmental toxins, lead poisoning, mercury posining, exposure to genetically modified 
organisms, exposure to radioactivity, pesticides, insecticides, and cigarette smoke, alcohol, or 
exercise. In a further embodiment, a condition is a state of health. 

*° As used herein, a disease of the invention includes, but is not limited to, blood 

disorder, blood lipid disease, autoimmune disease, arthritis (including osteoarthritis, 
rheumatoid arthritis, lupus, allergies, juvenile rheumatoid arthritis and the like), bone or joint 
disorder, a cardiovascular disorder (including heart failure, congenital heart disease; 
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rheumatic fever, valvular heart disease; corpulmonale, cardiomyopathy, myocarditis, 
pericardial disease; vascular diseases such as atherosclerosis, acute myocardial infarction, 
ischemic heart disease and the like), obesity, respiratory disease (including asthma, 
pneumonitis, pneumonia, pulmonary infections, lung disease, bronchiectasis, tuberculosis, 
5 cystic fibrosis, interstitial lung disease, chronic bronchitis emphysema, pulmonary 

hypertension, pulmonary thromboembolism, acute respiratory distress syndrome and the 
like), hyperlipidemias, endocrine disorder, immune disorder, infectious disease, muscle 
wasting and whole body wasting disorder, neurological disorders (including migraines, 
seizures, epilepsy, cerebrovascular diseases, alzheimers, dementia, Parkinson's, ataxic 

10 disorders, motor neuron diseases, cranial nerve disorders, spinal cord disorders, meningitis 
and the like) including neurodegenerative and/or neuropsychiatric diseases and mood 
disorders (including schizophrenia, anxiety, bipolar disorder; manic depression and the like, 
skin disorder, kidney disease, scleroderma, stroke, hereditary hemorrhage telangiectasia, 
diabetes, disorders associated with diabetes (e.g., PVD), hypertension, Gaucher ! s disease, 

15 cystic fibrosis, sickle cell anemia, liver disease, pancreatic disease, eye, ear, nose and/or 

throat disease, diseases affecting the reproductive organs, gastrointestinal diseases (including 
diseases of the colon, diseases of the spleen, appendix, gall bladder, and others) and the like. 
For further discussion of human diseases, see Mendelian Inheritance in Man: A Catalog of 
Human Genes and Genetic Disorders by Victor A. McKusick (12th Edition (3 volume set) 

20 June 1998, Johns Hopkins University Press, ISBN: 0801857422) and Harrison's Principles of 
Internal Medicine by Braunwald, Fauci, Kasper, Hauser, Longo, & Jameson (15th Edition 
2001), the entirety of which is incorporated herein. 

In another embodiment of the invention, a disease refers to an immune disorder, 
such as those associated with overexpression of a gene or expression of a mutant gene (e.g., 

25 autoimmune diseases, such as diabetes mellitus, arthritis (including rheumatoid arthritis, 
juvenile rheumatoid arthritis, osteoarthritis, psoriatic arthritis), multiple sclerosis, 
encephalomyelitis, myasthenia gravis, systemic lupus erythematosis, automimmune 
thyroiditis, dermatitis (including atopic dermatitis and eczematous dermatitis), psoriasis, 
Sjogren's Syndrome, Crohn's disease, ulcerative colitis, aphthous ulcer, iritis, 

30 conjunctivitis, keratoconjunctivitis, ulcerative colitis, asthma, allergic asthma, cutaneous 
lupus erythematosus, scleroderma, vaginitis, proctitis, drug eruptions, leprosy reversal 
reactions, erythema nodosum leprosum, autoimmune uveitis, allergic encephalomyelitis, 
acute necrotizing hemorrhagic encephalopathy, idiopathic bilateral progressive 
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sensorineural hearing, loss, aplastic anemia, pure red cell anemia, idiopathic 
thrombocytopenia, polychondritis, Wegener's granulomatosis, chronic active hepatitis, 
Stevens- Johnson syndrome, idiopathic sprue, lichen planus, Graves' disease, sarcoidosis, 
primary biliary cirrhosis, uveitis posterior, and interstitial lung fibrosis), graft- versus-host 
disease, cases of transplantation, and allergy. 

In another embodiment, a disease of the invention is a cellular proliferative and/or 
differentiative disorder that includes, but is not limited to, cancer e.g., carcinoma, sarcoma 
or other metastatic disorders and the like. As used herein, the term "cancer" refers to cells 
having the capacity for autonomous growth, i.e., an abnormal state of condition 
characterized by rapidly proliferating cell growth. "Cancer" is meant to include all types of 
cancerous growths or oncogenic processes, metastatic tissues or malignantly transformed 
cells, tissues, or organs, irrespective of histopathologic type or stage of invasiveness. 
Examples of cancers include but are not limited to solid tumors and leukemias, including: 
apudoma, choristoma, branchioma, malignant carcinoid syndrome, carcinoid heart disease, 
carcinoma (e.g., Walker, basal cell, basosquamous, Brown-Pearce, ductal, Ehrlich tumour, 
in situ, Krebs 2, Merkel cell, mucinous, non-small cell lung, oat cell, papillary, scirrhous, 
bronchiolar, bronchogenic, squamous cell, and transitional cell), histiocytic disorders, 
leukaemia (e.g., B cell, mixed cell, null cell, T cell, T-cell chronic, HTLV-H-associated, 
lymphocytic acute, lymphocytic chronic, mast cell, and myeloid), histiocytosis malignant, 
Hodgkin disease, immunoproliferative small, non-Hodgkin lymphoma, plasmacytoma, 
reticuloendotheliosis, melanoma, chondroblastoma, chondroma, chondrosarcoma, fibroma, 
fibrosarcoma, giant cell tumors, histiocytoma, lipoma, liposarcoma, mesothelioma, 
myxoma, myxosarcoma, osteoma, osteosarcoma, Ewing sarcoma, synovioma, 
adenofibroma, adenolymphoma, carcinosarcoma, chordoma, craniopharyngioma, 
dysgerminoma, hamartoma, mesenchymoma, mesonephroma, myosarcoma, 
ameloblastoma, cementoma, odontoma, teratoma, thymoma, trophoblastic tumour, adeno- 
carcinoma, adenoma, cholangioma, cholesteatoma, cylindroma, cystadenocarcinoma, 
cystadenoma, granulosa cell tumour, gynandroblastoma, hepatoma, hidradenoma, islet cell 
tumour, Leydig cell tumour, papilloma, Sertoli cell tumour, theca cell tumour, leiomyoma, 
leiomyosarcoma, myoblastoma, mymoma, myosarcoma, rhabdomyoma, 
rhabdomyosarcoma, ependymoma, ganglioneuroma, glioma, medulloblastoma, 
meningioma, neurilemmoma, neuroblastoma, neuroepithelioma, neurofibroma, neuroma, 
paraganglioma, paraganglioma nonchromaffin, angiokeratoma, angiolymphoid hyperplasia 
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with eosinophilia, angioma sclerosing, angiomatosis, glomangioma, 
hemangioendothelioma, hemangioma, hemangiopericytoma, hemangiosarcoma, 
lymphangioma, lymphangiomyoma, lymphangiosarcoma, pinealoma, carcinosarcoma, 
chondrosarcoma, cystosarcoma, phyllodes, fibrosarcoma, hemangiosarcoma, 

5 leimyosarcoma, leukosarcoma, liposarcoma, lymphangiosarcoma, myosarcoma, 

myxosarcoma, ovarian carcinoma, rhabdomyosarcoma, sarcoma (e.g., Ewing, experimental, 
Kaposi, and mast cell), neoplasms (e.g., bone, breast, digestive system, colorectal, liver, 
pancreatic, pituitary, testicular, orbital, head and neck, central nervous system, acoustic, 
pelvic respiratory tract, and urogenital), neurofibromatosis, and cervical dysplasia, and 

10 other conditions in which cells have become immortalized or transformed. 

"Cardiovascular Disease" is defined herein as any disease or disorder of the cardiovascular 
system and includes arteriosclerosis, heart valve disease, arrhythmia, and ., orthostatic 
hypotension, shock, endocarditis, diseases of the aorta and its branches, disorders of the 
peripheral vascular system, and congenital heart disease.as a disease affecting the heart or 
15 blood vessels. Cardiovascular diseases include coronary artery disease, hearart failure, and 
hypertension. 

As used herein "Neurological Disease" is defined as a disorder of the nervous 
system, and include disorders that involve the central nervous system (brain, brainstem and 
cerebellum), the peripheral nervous system (including cranial nerves), and the autonomic 
20 nervous system (parts of which are located in both central and peripheral nervous system). 
In particular neurological disease includes alzheimers', schizophrenia, and manic 
depression syndrome. 

As used herein, a "population" or a "population of individuals" of the invention 
25 refers to a population of two or more individuals wherein the individuals have at least a 
single condition of interest in common. A population of the invention can also have two or 
more conditions in common. A population of the invention can also be comprised of two or 
more individuals who do not have a condition of interest. 

30 As used herein, "diagnosis" refers to the ability to demonstrate an increased 

likelihood that an individual has a specific condition or conditions. Diagnosis also refers to 
the ability to demonstrate an increased likelihood that an individual does not have a specific 
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condition. More particularly "diagnosis" refers to the ability to demonstrate an increased 
likelihood that an individual has one condition as compared to a second condition. More 
particularly "diagnosis" refers to a process whereby there is an increased likelihood that an 
individual is properly characterized as having a condition ("true positive") or is properly 
5 characterized as not having a condition ("true negative") while minimizing the likelihood 
that the individual is improperly characterized with said condition ("false positive") or 
improperly characterized as not being afflicted with said condition ("false negative"). 

As used herein, "treatment" refers to the administration of a drug, pharmaceutical, 
10 nutraceutical, or other form of therapeutic regime which has the potential to reverse or 

ameliorate the pathology of a disease condition, produce a change in a condition as measured 
by either the lessening of the number or severity of symptoms or effects of the condition, as 
determined by a physician. In a preferred embodiment a treatment of the invention is a 
treatment for a disease. In another preferred embodiment, a treatment of the invention is a 
15 treatment of a disease selected from the group of: liver cancer, urinary bladder cancer, 
gallbladder cancer, brain cancer, prostate cancer, ovarian cancer, cervical cancer, kidney 
cancer, gastric cancer, colon cancer, lung cancer, breast cancer, nasopharyngeal cancer, 
pancreatic cancer, osteoarthritis, depression, hypertension, heart failure, obesity, rheumatoid 
arthritis, hyperlipidemia, lung disease, Chagas' disease, allergies, schizophrenia and asthma, 
20 manic depression syndrome, ankylosing spondylitis, guillain barre syndrome, fibromyalgia, 
multiple sclerosis, muscular dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or 
colitis, or malignant hyperthermia susceptibility, psoriasis, thyroid disorder, irritable bowel 
syndrome, osteoporosis, migraines, eczema, or a heart murmer. 

25 As used herein, a "response to treatment" indicates a physiological change as a 

result of the "application of treatment" to a condition where "treatment" includes 
pharmaceuticals, neutraceuticals, and other drugs or treatment regimes. The relative 
success of a response to treatment is determined by a physician. As used herein, by the 
term "treatment regime" is meant a course of treatment ranging from a single application or 

30 dose to multiple applications of one or more doses over time. 

As used herein, a "biomarker" is a molecule which corresponds to a species of a 
nucleic acid transcript that has a quantitatively differential concentration or level in 
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bloodwith respect to an aspect of the condition of interest. As such, a biomarker includes a 
synthetic nucleic acid copolymer thereof, including cRNA, cDNA, and the like. A species of a 
nucleic acid transcript includes any nucleic acid transcript which is transcribed from any part 
of the individual's chromosomal and extrachromo somal genome including for example the 
5 mitochondrial genome. Preferably a species of a nucleic acid transcript is an RNA transcript, 
preferably the RNA transcript includes a primary transcript, a spliced transcript, an 
alternatively spliced transcript, or an mRNA. An aspect of the condition of interest includes 
the presence or absence of the condition in an individual or group of individuals for which the 
biomarker is identified or assayed, and also includes the stage of progression or regression of 

10 a condition including a disease condition. For example, a biomarker is a molecule which 
corresponds to a species of an RNA transcript which is present at an increased level s or a 
decreased level of in the blood of an individual or a population of individuals having at 
least one condition of interest, when compared to the level of said transcript in the blood from 
a population of individuals not having said condition of interest.. Molecules encompassed by 

15 the term biomarker includeESTs, cDNAs, primers, etc. A biomarker can be used either 
solely or in conjunction with one or more other identified biomarkers, so as to allow 
diagnosis of a condition of interest as defined herein. 

As used herein, the term " concentration or level" of a species of an RNA 
transcript refers to the measurable quantity of a given biomarker. The " " concentration or 

20 level" " of a species of an RNA transcript can be determined by measuring the level of RNA 
using semi-quantitative methods such as microarray hybridization or more quantitative 
measurements such as quantitative real-time RT-PCR which corresponds in direct proportion 
with the extent to which the gene is expressed. The "concentration or level " of a species of 
an RNA transcript is determined by methods well known in the art. As used herein the term 

25 "differential expression" refers to a difference in the level of expression of a species of an 
RNA nucleic acid transcript, as measured by the amount or level of RNA or can also include 
a measurement of the protein encoded by the gene corresponding to the nucleic acid 
transcript, in a sample or population of samples as compared with the amount or level of 
RNA or protein expression of the same nucleic acid transcript in a second sample or second 

30 population of samples. The term "differentially expressed" or "changes in the level of 

expression" refers to an increase or decrease in the measurable expression level of a given 
biomarker in a sample as compared with the measurable expression level of a given 
biomarkerin a second sample. The term "differentially expressed" or "changes in the level of 
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expression" can also refer to an increase or decrease in the measurable expression level of a 
given biomarkerin a population of samples as compared with the measurable expression 
level of a biomarkerin a second population of samples. As used herein, "differentially 
expressed" when referring to a single sample can be measured using the ratio of the level of 
5 expression of a given nbiomarker in said sample as compared with the mean expression level 
of the given biomarker of a control population wherein the ratio is not equal to 1.0. 
Differentially expressed can also be used to include comparing a first population of samples 
as compared with a second population of samples or a single sample to a population of 
samples using either a ratio of the level of expression or using p-value. When using p-value, 

10 a nucleic acid transcript is identified as being differentially expressed as between a first and 
second population when the p-value is less than 0.1. More preferably the p-value is less than 
0.05. Even more preferably the p-value is less than 0.01. More preferably still the p-value is 
less than 0.005. Most preferably the p-value is less than 0.001 . When determining whether a 
nucleic acid transcript is differentially expressed on the basis of the ratio of the level of 

15 expression, a nucleic acid transcript is differentially expressed if the ratio of the level of 

expression of a nucleic acid transcript in a first sample as compared with a second sample is 
greater than or less than 1.0. For example, a ratio of greater than 1.2, 1.5, 1.7, 2, 3, 4, 10, 
20or a ratio less than 1, for example 0.8, 0.6, 0.4, 0.2, 0.1. 0.05. In another embodiment of 
the invention a nucleic acid transcript is differentially expressed if the ratio of the mean of the 

20 level of expression of a first population as compared with the mean level of expression of the 
second population is greater than or less than 1.0 For example, a ratio of greater than 1.2, 1.5, 
1.7, 2, 3, 4, 10, 20 or a ratio less than 1, for example 0.8, 0.6, 0.4, 0.2, 0.1. 0.05In another 
embodiment of the invention a nucleic acid transcript is differentially expressed if the ratio of 
its level of expression in a first sample as compared with the mean of the second population 

25 is greater than or less than 1 .0 and includes for example, a ratio of greater than 1 .2, 1 .5, 1 .7, 
2, 3, 4, 10, 20, or a ratio less than 1, for example 0.8, 0.6, 0.4, 0.2, 0.1. 0.05. "Differentially 
increased expression" refers to 1 . 1 fold, 1 .2 fold, 1 .4 fold, 1 .6 fold, 1 .8 fold, or more, relative 
to a standard, such as the mean of the expresion level of the second population. 
"Differentially decreased expression" refers to less than 1.0 fold, 0.8 fold, 0.6 fold, 0.4 fold, 

30 0.2 fold, 0.1 fold or less, relative to a standard, such as the mean of the expresion level of the 
second population.. 

A nucleic acid transcript is also said to be differentially expressed in two samples if 
one of the two samples contains no detectable expression of the nucleic acid transcript. 
Absolute quantification of the level of expression of a nucleic acid transcriptcan be 
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accomplished by including known concentration(s) of one or more control nucleic acid 
transcript, generating a standard curve based on the amount of the control s nucleic acid 
transcriptand extrapolating the expression level of the "unknown" nucleic acid transcript, for 
example, from the real-time RT PGR hybridization intensities of the unknown with respect to 
5 the standard curve. 

i 

By a micleic acid transcript that is "expressed in blood" is meant a nucleic acid 
transcript that is expressed in one or more cells of blood, wherein the cells of blood include 
monocytes, leukocytes, lymphocytes, erythrocytes, all other cells derived directly from 
10 haemopoietic or mesenchymal stem cells, or cells derived directly from a cell which typically 
makes up the blood. 



The term "biomarker" further includes any molecule that correlates to, or is 
reflective of the transcript produced from any region of nucleic acid that can be transcribed, 

15 as the invention contemplates detection of RNA or equivalents thereof, i.e., cDNA or EST. 
A biomarker of the invention includes but is not limited to regions which are translated 
into proteins which are specific for or involved in a particular biological process, such as 
apoptosis, differentiation, stress response, aging, proliferation, etc.; cellular mechanism 
genes, e.g. cell-cycle, signal transduction, metabolism of toxic compounds, and the like; 

20 disease associated genes, e.g. genes involved in cancer, schizophrenia, diabetes, high blood 
pressure, atherosclerosis, viral-host interaction, infection and the like. A biomarker of the 
the invention includes, but is not limited to transcripts transcribed from immune response 
genes. A gene of the invention is a biomarker of a condition and can be a biomarker of 
disease, or a biomarker of a non disease condition as defined herein. 

25 For example, a biomolecule can be reflective of or correlate to the transcript from any 

gene , includingan oncogene (Hanahan, D. and R.A. Weinberg, Cell (2000) 100:57; and 
Yokota, J., Carcinogenesis (2000) 21(3):497-503) whose expression within a cell induces that 
cell to become converted from a normal cell into a tumor cell. Examples of genes which 
produce transcripts)' to which a biomarker is correlated to or reflective of, include, but are 

30 not limited to, include cytokine genes (Rubinstein, M., et al., Cytokine Growth Factor Rev. 
(1998) 9(2):175-81); idiotype (Id) protein genes (Benezra, R., et al., Oncogene (2001) 
20(58):8334-41; Norton, J.D., J. Cell Sci. (2000) 113(22):3897-905); prion genes (Prusiner, 

28 



WO 2004/112589 PCT/US2004/020836 
S.B., et aL, Cell (1998) 93(3):337-48; Safar, J., and S.B. Prusiner, Prog. Brain Res. (1998) 
1 17:421-34); genes that express molecules that induce angiogenesis (Gould, V.E. and B.M. 
Wagner, Hum. Pathol. (2002) 33(ll):1061-3); genes encoding adhesion molecules (Chothia, 
C. and E.lf . Jones, Annu. Rev. Biochem. (1997) 66:823-62; Parise, L.V., et aL, Semin. 
5 Cancer Biol. (2000) 10(6):407-14); genes encoding cell surface receptors (Deller, M.C., and 
Y.E. Jones, Curr. Opin. Struct. Biol. (2000) 10(2):213-9); genes of proteins that are involved 
in metastasizing and/or invasive processes (Boyd, D., Cancer Metastasis Rev. (1996) 
15(l):77-89; Yokota, J., Carcinogenesis (2000) 21(3):497-503); genes of proteases as well as 
of molecules that regulate apoptosis and the cell cycle (Matrisian, L.M., Curr. Biol. (1999) 
0 9(20):R776-8; Krepela, E., Neoplasma (2001) 48(5):332-49; Basbaum and Werb, Curr. Opin. 
Cell Biol. (1996) 8:731-738; Birkedal-Hansen, et aL, Crit. Rev. Oral Biol. Med. (1993) 
4:197-250; Mignatti and Rifkin, Physiol. Rev. (1993) 73:161-195; Stetler-Stevenson, et aL, 
Annu. Rev. Cell Biol. (1993) 9:541-573; Brinkerhoff, E., and L.M. Matrisan, Nature Reviews 
(2002) 3:207-214; Strasser, A., et aL, Annu. Rev. Biochem. (2000) 69:217-45; Chao, D.T. 

15 and S.J. Korsmeyer, Annu. Rev. Immunol. (1998) 16:395-419; Mullauer, L., et al., Mutat. 
Res. (2001) 488(3):21 1-31; Fotedar, R., et aL, Prog. Cell Cycle Res. (1996) 2:147-63; Reed, 
J.C., Am. J. Pathol. (2000) 157(5):1415-30; D'Ari, R., Bioassays (2001) 23(7):563-5); or 
multi-drug resistance genes, suach as MDR1 gene (Childs, S., and V. Ling, Imp. Adv. Oncol. 
(1994) 21-36). Li another embodiment, a gene which produces transcript(s) to which a 

20 biomarker is correlated to or reflective of, include, but are not limited to, an immune 
response gene or a non-immune response gene. By an immune response gene is meant a 
primary defense response gene located outside the major histocompatibility region (MHC) 
that is initially triggered in response to a foreign antigen to regulate immune responsiveness. 
All other genes expressed in blood are considered to be non-immune response genes. For 

25 example, an immune response gene would be understood by a person skilled in the art to 
include: cytokines including interleukins and interferons such as TNF-alpha, IL-10, IL-12, 
IL-2, IL-4, IL-10, IL-12, IL-13, TGF-Beta, IFN-gamma; immunoglobulins, complement and 
the like (see for example Bellardelli, F. Role of interferons and other cytokines in the 
regulation of the immune response APMIS. 1995 Mar; 103(3): 161-79; ); . 

30 

Construction of a Nucleic Acid Array 

A nucleic acid microarray (RNA, DNA, cDNA, PCR products or ESTs) according to 
the invention can be constructed as follows: 
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Nucleic acids (RNA, DNA, cDNA, PGR products or ESTs) (~40|il) are precipitated 
with 4pl (1/10 volume) of 3M sodium acetate (pH 5.2) and 100 ul (2.5 volumes) of ethanol 
and stored overnight at -20°C. They are then centrifuged at 3,300 rpm at 4°C for 1 hour. 
The obtained pellets are washed with 50|ul ice-cold 70% ethanol and centrifuged again for 30 
5 minutes. The pellets are then air-dried and resuspended well in 50% dimethylsulfoxide 
(DMSO) or 20jj1 3X SSC overnight. The samples are then deposited either singly or in 
duplicate onto Gamma Amino Propyl Silane (Corning CMT-GAPS or CMT-GAP2, Catalog 
No. 40003, 40004) or polylysine-coated slides (Sigma Cat. No. P0425) using a robotic GMS 
417 or 427 arrayer (Affymetrix, CA). The boundaries of the DNA spots on the microarray 
10 are marked with a diamond scriber. The invention provides for arrays where 10-20,000 
different DNAs are spotted onto a solid support to prepare an array, and also may include 
duplicate or triplicate DNAs. 

The arrays are rehydrated by suspending the slides over a dish of warm particle free 
ddH20 for approximately one minute (the spots will swell slightly but not run into each other) 

15 and snap-dried on a 70-80°C inverted heating block for 3 seconds. DNA is then UV 

crosslinked to the slide (Stratagene, Stratalinker, 65mJ - set display to "650" which is 650 x 
lOOjaJ) or baked at 80°C for two to four hours. The arrays are placed in a slide rack. An 
empty slide chamber is prepared and filled with the following solution: 3.0 grams of succinic 
anhydride (Aldrich) is dissolved in 189ml of l-methyl-2-pyrrolidinone (rapid addition of 

20 reagent is crucial); immediately after the last flake of succinic anhydride dissolved, 21.0ml of 
0.2 M sodium borate is mixed in and the solution is poured into the slide chamber. The slide 
rack is plunged rapidly and evenly in the slide chamber and vigorously shaken up and down 
for a few seconds, making sure the slides never leave the solution, and then mixed on an 
orbital shaker for 15-20 minutes. The slide rack is then gently plunged in 95°C ddH 2 0 for 2 

>5 minutes, followed by plunging five times in 95% ethanol. The slides are then air dried by 
allowing excess ethanol to drip onto paper towels. The arrays are then stored in the slide box 
at room temperature until use. 



Nucleic acid Arrays 

A nucleic acid array comprises any combination of the nucleic acid sequences 
generated from, or complementary to nucleic acid transcripts, or regions thereof, including 
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the species of nucleic acid transcripts present in blood . Preferably, for identifying 
biomarkers of a disease or condition of interest, one utilizes a microarray so as to minimize 
cost and time of the experiment. In one embodiment, the microarray is an EST microarray 
which includes ESTs complementary to genes expressed in blood. A microarray according to 
the invention preferably comprises between 10, 100, 500, 1000, 5000, 10,000 and 15,000 
nucleic acid members, and more preferably comprises at least 5000 nucleic acid members. 
The nucleic acid members are known or novel nucleic acid sequences described herein, or 
any combination thereof. A microarray according to the invention is used to assay for 
differential levels of species of transcripts RNA expression profilespresent in blood samples 
from healthy patients as compared to patients with a disease. 



Microarrays 

Microarrays include those arrays which encompass transcripts which are expressed in 
the individual. In one embodiment, a microarray which encompasses transcripts which are 
expressed in humans. In a preferred embodiment microarrays of the invention can be either 
cDNA based arrays or oligonucleotide based arrays. 

Oligonucleotide Arrays 

In a preferred embodiment, the oligonucleotide based microarrays of Affymetrix® are 
utilized. More particularly the Affymetrix® Human Genome U133 (HG-U133) Set, 
consisting of two GeneChip® arrays, contains almost 45,000 probe sets representing more 
than 39,000 transcripts derived from approximately 33,000 well-substantiated human genes. 
This set design uses sequences selected from GenBank®, dbEST, and RefSeq. More recently 
Affymetrix® has available the U133 Plus 2.0 GeneChip® which represents over 47,000 
transcripts. It is expected as more genes and transcripts are identified as a result of the human 
genome sequencing project, additional generations of microarrays will be developed. 

The sequence clusters were created from the UniGene database (Build 133, April 20, 
2001). They were then refined by analysis and comparison with a number of other publicly 
available databases including the Washington University EST trace repository and the 
University of California, Santa Cruz Golden Path human genome database (April 2001 
release). 
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The HG-U133A Array includes representation of the RefSeq database sequences and 
probe sets related to sequences previously represented on the Human Genome U95Av2 
Array. The HG-U133B Array contains primarily probe sets representing EST clusters. 

The U133 Plus 2.0 Array includes all probe sets represented on the GeneChip Human 
Genome U133 Set (U133A and U133B). The U133 Plus 2.0 includes an additional 6,500 
genes for analysis of over 47,000 transcripts. 

cDNA Based Arrays 

15 K ChondroChip™ - The ChondroChip™ is an EST based microarray and includes 
approximately 15,000 ESTs complementy to genes also expressed in human chondrocytes. 
Various versions of the 15K ChondroChip™ were used, depending upon the experiment in an 
effort to utilize a microarray which reduced redundancy so as to increase the percentage of 
unique genes and thus encompass representation of as much of the entire genome as 
possible. 

Controls on the ChondroChip™ - There are two types of controls used on microarrays. 
First, positive controls are genes whose expression level is invariant between different stages 
of investigation and are used to monitor: 

a) target DNA binding to the slide, 

b) quality of the spotting and binding processes of the target DNA onto the slide, 

c) quality of the RNA samples, and 

d) efficiency of the reverse transcription and fluorescent labeling of the probes. 
Second, negative controls are external controls derived from an organism unrelated to 

and therefore unlikely to cross-hybridize with the sample of interest. These are used to 
monitor for: 

a) variation in background fluorescence on the slide, and 

b) non-specific hybridization. 

There are currently 63 control spots on the ChondroChip™ consisting of: 

Type No. 

Positive Controls: 2 



Alien DNA 



12 



A. thaliana DNA 



10 



Spotting Buffer 



41 
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BloodChip™ - The "BloodChip™" can also be used. The BloodChip is a cDNA microarray 
slide with 10,368 PCR products derived from peripheral blood cell cDNA libraries as shown 
in Figure 24. 

30K BodyChip™ - The BodyChip™ is an EST based microarray which incorporates the 
5 unique cDNA clones from both the BloodChip™ and the ChondroChip™. The BodyChip™ 
includes coverage of over 30,000 genes. 

Identifying Biomarkers Useful in Accordance with the Invention 
Collection of Blood 

Blood is drawn according to the methods of standard phlebotomy. A blood sample 
10 useful according to the invention is a blood sample ranging in volume from as little as a 
drop of blood to 100ml, more preferably a blood sample is 10ml to 60 ml, even more 
preferably a blood sample is between 25ml to 40ml. A blood sample that is useful 
according to the invention is in an amount that is sufficient for the detection of one or more 
genes according to the invention. 

15 

In one embodiment, 30 mis of blood is isolated and stored on ice within a K3/EDTA 
tube. In another embodiment, one can utilize tubes for storing blood which contain 
stabilizing agents such as disclosed in US patent 6,617,170. In another embodiment the 
PAXgene™ blood RNA system iprovided by PreAnalytiX, a Qiagen/BD company may be 
20 used to collect blood. The PAXgene™ system is standardized on convenient BD 

Vacutainer™ technology. In yet another embodiment, the Tempus™ blood RNA collection 
tubes, offered by Applied Biosystems may be used. Tempus™ collection tubes provide a 
closed evacuated plastic tube containing RNA stabilizing reagent for whole blood collection, 
processing and subsequently RNA isolation. 

25 

In a preferred embodiment, RNA is isolated from said blood sample stored on ice 
within 24 hours, more preferably within 10 hours, even more preferably within 6 hours of 
collection most preferably immediately after drawing said blood, hi another preferred 
embodiment, wherein stabilizers are utilized, such as with the PAXgene™ system, RNA is 
30 isolated from said blood sample can be isolated after storage at room temperature for 2-4 
days, or isolated from a blood sample stored at 4° C for a number of weeks. 
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Isolation and Preparation of RNA 

Bl ood Samples 

In another aspect of the invention, a blood sample, as used herein, refers to a sample 
of whole blood without prior fractionation, a sample of subsets of blood cells, and a sample 
of specific types of blood cells. Accordingly, a blood sample includes, but is not limited to, 
whole blood without prior fractionation, peripheral blood leukocytes (PBL's), granulocytes, 
agranulocytes, T lymphocytes, B lymphocytes, monocytes, macrophages, eosinophils, 
neutrophils, basophils, erythrocytes, and platelets separated from whole blood. 

Whole Blood 

In one embodiment, a blood sample of the invention is whole blood without prior 
fractionation. By whole blood is meant blood which is unfractionated. Wh ole blood 
includes a drop of blood, a pinprick of blood. Whole blood also includes blood in which 
the serum or plasma is removed. Whole blood without prior fractionation can be used 
directly, or one can remove the serum or plasma and isolate RNA or mRNA from the 
remaining blood sample in accordance with methods well known in the art. The use of 
whole blood without fractionation is preferred since it avoids the costly and time-consuming 
need to separate out the cell types within the blood (Kimoto Kimoto Y (1998) Mol Gen. 
Genet 258:233-239; Chelly J et al. (1989). Proc. Nat. Acad. Set USA. 86:2617-2621; Chelly 
J et al (1988). Nature 333:858-860). In a preferred embodiment, the whole blood sample can 
have the plasma or serum removed by centrifugation, using preferably gentle centrifugation 
at 300-800xg for five to ten minutes.Jta another preferred embodiment, lysis buffer is added 
to the wholeblood sample without prior fractionation, prior to extraction of RNA. Lysis 
Buffer (1L) 0.6g EDTA; l.Og KHC0 2 , 8.2g NH 4 C1 adjusted to pH 7.4 (using NaOH). Once 
mixed with lysing buffer, the sample can be centrifuged and the cell pellet containing the 
RNA or mRNA extracted in accordance with methods known in the art (see for example 
Sambrook et al.) 

Peripheral Blood Leukocytes (PBLs) 

In another embodiment, a blood sample of the invention is a sample of peripheral 
blood leukocytes (PBLs). Whole blood without prior fractionation is obtained from a normal 
patient or from an individual diagnosed with, or suspected of having a disease or condition, 
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according to methods of phlebotomy well known in the art. PBLs are separated from the 
remainder of the blood using methods known in the art. For example, PBLs can be separated 
using a Ficoll® gradient . 

5 In another embodiment, a blood sample of the invention is a sample of granulocytes. 

In another embodiment, a blood sample of the invention is a sample of neutrophils, 
eosinophils, basophils or any combination thereof. In another embodiment, a blood sample 
of the invention is a sample of agranulocytes. In another embodiment, a blood sample of the 
invention is a sample of lympocytes, monocytes or a combination thereof. In yet another 

10 embodiment, a blood sample of the invention is a sample of T lymphocytes, B lymphocytes 
or a combination thereof 

_In one aspect, a whole blood sample without prior fractionation is obtained from a 
normal patient or from an individual diagnosed with, or suspected of having, a disease or 

15 condition according to methods of phlebotomy well known in the art. A whole blood sample 
without prior fractionation that is useful according to the invention is in an amount that is 
sufficient for the detection of one or more nucleic acid sequences according to the invention. 
In a preferred embodiment, a whole blood sample without prior fractionation is in an amount 
ranging from 1 jxl to 100ml, more preferably 10 pi to 50 ml, even more preferably 10 jllI to 

20 25ml and most preferably 10 jliI to 1 ml. 

Quantitation of RNA using Microarray Analysis 

In one embodiment of the invention, the expression levels of transcripts from 
individuals or populations of individuals having a condition, or not having a condition are 
25 measured using an array. In a preferred embodiment either a cDNA based microarray or an 
oligonucleotide based microarray are used, for example, the ChondroChip™ or the 
Affymetrix GeneChip® U133A, U133B or U133 Plus version are utilized. 

Microarray hybridization experiments utilizing the Affymetrix® GeneChip® 
platforms (U133A and U133 Plus 2.0) microarray 5 s are preferably performed in accordance 
30 with the Affymetrix® instructions. 
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Microarray hybridization experiments utilizing the ChondroChip™ are preferably 
performed as described below. 



Preparation of Fluorescent DNA Probe from mJRNA 

Fluorescently labeled target nucleic acid samples are prepared for analysis with an 
5 array of the invention. 

In one embodiment of the invention, labeled cDNA is prepared for hybridization to 
the ChondroChip™ microarray using 2[ig Oligo-dT primers annealed to 2|xg of mRNA 
isolated from a blood sample of a patient in a total volume of 15)al, by heating to 70°C for 10 
min, and cooled on ice. 

10 In another embodiment of the invention, 20 ug of total RNA can be utilized for 

preparation of labeled cDNA for purposes of hybridization. 

In another embodiment of the invention, RNA can be amplified (aRNA) from either 
total RNA or mRNA. In a preferred embodiment aRNA is made from total RNA. Total 
RNA is extracted with TRIzol as stated previously. 0.1 ~ 0.5 ug total RNA from each sample 
15 is then subjected to RNA amplification using RNA Amplification Kit (Arcturus, Catalog 

#KIT0201) following the user guide. 2.5 ug amplified RNA was then used for probe labeling 
by reverse transcription with 1 mM Cy3 or Cy5 (Pharmacia). The protocol used for 
hybridization was based on that described previously (H.Zhang 2002). 

20 The mRNA is reverse transcribed by incubating the sample at 42°C for 1.5-2 hours in 

a 100 jllI volume containing a final concentration of 50mM Tris-HCl (pH 8.3), 75mM KC1, 
3mM MgCl 2 , 25mM DTT, 25mM unlabeled dNTPs, 400 units of Superscript II (200U/|liL, 
Gibco BRL), and 15mM of Cy3 or Cy5 (Amersham). RNA is then degraded by addition of 
15(xl of 0.1N NaOH, and incubation at 70 DC for 10 min. The reaction mixture is neutralized 

25 by addition of 15|xl of 0.1N HC1, and the volume is brought to SOOjllI with TE (lOmM Tris, 
ImM EDTA), and 20 jig of Cotl human DNA (Gibco-BRL) is added. 

The labeled target nucleic acid sample is purified by centrifugation in a Centricon-30 
micro-concentrator (Amicon). If two different target nucleic acid samples (e.g., two samples 
derived from a healthy patient vs. patient with a disease) are being analyzed and compared by 
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hybridization to the same array, each target nucleic acid sample is labeled with a different 
fluorescent label (e.g., Cy3 and Cy5) and separately concentrated. The separately 
concentrated target nucleic acid samples (Cy3 and Cy5 labeled) are combined into a fresh 
centricon, washed with 500jxl TE, and concentrated again to a volume of less than 7|xl. l\xL 
5 of lO^g/^il polyA RNA (Sigma, #P9403) and lpl of lOjxg/^1 tRNA (Gibco-BRL, #15401- 
01 1) is added and the volume is adjusted to 9.5 jil with distilled water. For final target nucleic 
acid preparation 2. ljal 20XSSC (1.5M NaCl, 150mM NaCitrate (pH8.0)) and0.35|il 10%SDS 
is added. 

Hybridization 

10 Labeled nucleic acid is denatured by heating for 2 min at 100°C, and incubated at 

37°C for 20-30 min before being placed on a nucleic acid array under a 22mm x 22mm glass 
cover slip. Hybridization is carried out at 65°C for 14 to 18 hours in a custom slide chamber 
with humidity maintained by a small reservoir of 3XSSC. The array is washed by 
submersion and agitation for 2-5 min in 2X SSC with 0.1%SDS, followed by IX SSC, and 

15 0. IX SSC. Finally, the array is dried by centrifugation for 2 min in a slide rack in a Beckman 
GS-6 tabletop centrifuge in Microplus carriers at 650 RPM for 2 min. 

Signal Detection And Data Generation 

Following hybridization of an array with one or more labeled target nucleic acid 
samples, arrays are scanned immediately using a GMS Scanner 418 and Scanalyzer software 
20 (Michael Eisen, Stanford University), followed by GeneSpring™ software (Silicon Genetics, 
CA) analysis. Alternatively, a GMS Scanner 428 and Jaguar software may be used followed 
by GeneSpring™ software analysis. 

If one target nucleic acid sample is analyzed, the sample is labeled with one 
fluorescent dye (e.g., Cy3 or Cy5). 

25 After hybridization to a microarray as described herein, fluorescence intensities at the 

associated nucleic acid members on the microarray are determined from images taken with a 
custom confocal microscope equipped with laser excitation sources and interference filters 
appropriate for the Cy3 or Cy5 fluorescence. 
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The presence of Cy3 or Cy5 fluorescent dye on the microarray indicates hybridization 
of a target nucleic acid and a specific nucleic acid member on the microarray. The intensity 
of Cy3 or Cy5 fluorescence represents the amount of target nucleic acid which is hybridized 
to the nucleic acid member on the microarray, and is indicative of the expression level of the 
specific nucleic acid member sequence in the target sample. 

After hybridization, fluorescence intensities at the associated nucleic acid members on 
the microarray are determined from images taken with a custom confocal microscope 
equipped with laser excitation sources and interference filters appropriate for the Cy3 and 
Cy5 fluors. Separate scans are taken for each fluor at a resolution of 225 \xm 2 per pixel and 
65,536 gray levels. Normalization between the images is used to adjust for the different 
efficiencies in labeling and detection with the two different fluors. This is achieved by 
manual matching of the detection sensitivities to bring a set of internal control genes to nearly 
equal intensity followed by computational calculation of the residual scalar required for 
optimal intensity matching for this set of genes. 

The presence of Cy3 or Cy5 fluorescent dye on the microarray indicates hybridization 
of a target nucleic acid and a specific nucleic acid member on the microarray. The intensities 
of Cy3 or Cy5 fluorescence represent the amount of target nucleic acid which is hybridized to 
the nucleic acid member on the microarray, and is indicative of the expression level of the 
specific nucleic acid member sequence in the target sample. If a nucleic acid member on the 
array shows no color, it indicates that the element is not expressed in sufficient levels to be 
detected in either sample. If a nucleic acid member on the array shows a single color, it 
indicates that a labeled gene is expressed only in that cell sample. The appearance of both 
colors indicates that the gene is expressed in both tissue samples. The ratios of Cy3 and Cy5 
fluorescence intensities, after normalization, are indicative of differences of expression levels 
of the associated nucleic acid member sequence in the two samples for comparison. A ratio of 
expression not equal to 1.0 is used as an indication of differential gene expression. 

The array is scanned in the Cy 3 and Cy5 channels and stored as separate 16-bit TIFF 
images. The images are incorporated and analyzed using Scanalyzer™ software which 
includes a gridding process to capture the hybridization intensity data from each spot on the 
array. The fluorescence intensity and background-subtracted hybridization intensity of each 
spot is collected and a ratio of measured mean intensities of Cy5 to Cy3 is calculated. A 
linear regression approach is used for normalization and assumes that a scatter plot of the 
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measured Cy5 versus Cy3 intensities should have a slope of one. The average of the ratios is 
calculated and used to rescale the data and adjust the slope to one. A post-normalization 
cutoff of a ratio not equal to 1.0-is used to identify differentially expressed genes. 

Annotation to Identify those RNA transcripts which are differentially expressed in blood 

In one aspect of the invention, Affymetrix® GeneChip® platforms (U133A and U133 
Plus 2.0) microarrays are used. As would be understood by a person skilled in the art, each 
"gene ID" on an Affymetrix® microarray represents a number of oligonucleotide probe pairs 
corresponding to a region of transcribed ma, each probe pair consists of a matched and a 
mismatched oligonucleotide, wherein the matched oligonucleotide is 100% complementary to 
a RNA which is transcribed in humans. The mismatched oligonucleotide is less than 100% 
complementary to a region of a gene or a region of RNA which is transcribed in humans. 
Microarrays of the invention useful for identifying biomarkers for human conditions include 
the U95 array, the U133A array, the U133B array or the U133 plus 2.0 array. As would be 
understood by a person skilled in the art, the term "gene ID" can also be termed "spot 
number" or "spot ID" or "probe set ID". An example of a gene ID used by Affymetrix® is 
160020_at; 1494_f_at,; or 200003_s_at. 

Gene ID 's are annotated by Affymetrix and the results of the annotation are available 
on the Affymetrix website at www. affymetrix. com . As used herein "annotation" when used 
in the context of the Affymetrix® microarray is the information which allows one to identify 
the expressed RNA and, if applicable, the resulting protein translated by the expressed RNA 
which is being measured as a result of the binding of RNA to the probe pairs of the 
microarray. The annotation master table for the Affymetrix human microarrays is disclosed 
in Table 8A. Details as to the annotation provided in Table 8 A are shown below in Table 9. 



39 



WO 2004/112589 



PCT/US2004/020836 



Table 9 



Affymetrix 


15kChondroChip 




Probe Set ID 


ClonelD 


Affymetrix ID for the probe or 
ChondroGene's cDNA clone 
ID 


Target Description 


Target Description 


Description of the represented 
gene 


Representative Public ID 


Accession 


Genbank (or internal in the 
case of some Affy IDs) 
database identifier(s) for the 
represented gene 


Overlapping Transcripts 




Details of overlapping 

11 cHloC-X ip Lo ±UUilU. ill cL 

chromosomal region that 
aligns with a target sequence. 




Aliases 


Gene name synonyms. 


Gene Title 


Gene Title 


Name of represented gene. 


Gene Symbol 


Gene Symbol 


Official symbol of represented 
gene. 


UniGene ID 


UniGene ID 


The identifier for the UniGene 
cluster to which the 
represented gene belongs. 


Ensembl 




Ensembl database identifier 
for the represented gene. 



40 



WO 2004/112589 



PCT/US2004/020836 



LocusLink 


J^OCUSJUIIIK 


T nni^T.inlc database 
identifier(s) for the 
represented gene. 


SwissFrot 




O VY lool 1UI \JLCLLCLIJ CXij\*> 

identifier(s) for the 
represented gene. 


RefSeq Protem ID 


Keioeq xroxem lu 


database identifier(s) for the 
represented gene. 


RefSeq Transcript ID 




Transcript Reference 
Sequence database 
identifier(s) for the 
represented gene. 



In another aspect of the invention, cDNA based arrays such as the ChondroChip™ are 
used. Sequences corresponding to EST sequences are spotted onto the microarray. 
Sequences used include those previously identified using cartilage tissue library clones as 

5 outlined in H. Zhang et al. Osteoarthritis and Cartilage (2002) 10, 950-960. The 

differentially expressed EST sequences of the microarrays of the invention are annotated by 
searching against available databases, including the "nt", "nr ? \ "est", "gss" and "htg" data 
bases available through NCBI to determine putative identities for ESTs matching to known 
genes or other ESTs. Functional characterisation of ESTs with known gene matches are 

10 made according to any known method. Preferably, differentially expressed EST sequences 
are compared to the non-redundant Genbank/EMBL/DDBJ and dbEST databases using the 
BLAST algorithm (Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local 
alignment search tool. J Mol Biol 1990;215:403-10). A minimum value of P =10" 10 and 
nucleotide sequence identity >95%, where the sequence identity is non-contiguous or 

15 scattered, are required for assignments of putative identities for ESTs matching to known 

genes or to other ESTs. Construction of a non-redundant list of genes represented in the EST 
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set is done with the help of Unigene, Entrez and PubMed at the National Center for 
Biotechnology Information (NCBI) web site at www.ncbi.nlm.nih.gov. 



Genes are identified from ESTs according to known methods. To identify novel 
genes from an EST sequence, the EST should preferably be at least 100 nucleotides in length, 
and more preferably 150 nucleotides in length, for annotation. Preferably, the EST exhibits 
open reading frame characteristics (i.e., can encode a putative polypeptide). 

Having identified an EST corresponding to a larger sequence, other portions of the 
larger sequence which comprises the EST can be used in assays to elucidate gene function, 
e.g., to isolate polypeptides encoded by the gene, to generate antibodies specifically reactive 
with these polypeptides, to identify binding partners of the polypeptides (receptors, ligands, 
agonists, antagonists and the like) and/or to detect the expression of the gene (or lack thereof) 
in healthy or diseased individuals. 

In another aspect, the invention provides for nucleic acid sequences that do not 
demonstrate a "significant match" to any of the publicly known sequences in sequence 
databases at thq time a query is done. Longer genomic segments comprising these types of 
novel EST sequences can be identified by probing genomic libraries, while longer expressed 
sequences can be identified in cDNA libraries and/or by performing polymerase extension 
reactions (e.g., RACE) using EST sequences to derive primer sequences as is known in the 
art. Longer fragments can be mapped to particular chromosomes by FISH and other 
techniques and their sequences compared to known sequences in genomic and/or expressed 
sequence databases. 

The amino acid sequences encoded by the ESTs can also be used to search databases, 
such as GenBank, SWISS-PROT, EMBL database, PIR protein database, Vecbase, or 
GenPept for the amino acid sequences of the corresponding full-length genes according to 
procedures well known in the art. 

Alternative methods for analysing ESTs are also available. For example, the ESTs 
may be assembled into contigs with sequence alignment, editing, and assembly programs 
such as PHRED and PHRAP (Ewing, et al, 1998, Genome Res. 3:175, incorporated herein; 
and the web site at bozeman.genome.washington.edu). Contig redundancy is reduced by 
clustering nonoverlapping sequence contigs using the EST clone identification number, 
which is common for the nonoverlapping 5 and 3 sequence reads for a single EST cDNA 



42 



WO 2004/112589 PCT/US2004/020836 
clone. In one aspect, the consensus sequence from each cluster is compared to the non- 
redundant Genbank/EMBL/DDBJ and dbEST databases using the BLAST algorithm with the 
help of unigene, Entrez and PubMed at the NCBI site. 

EST clones used to spot onto the ChondroChip™ have been annotated using the 
5 methods described above. Results are reported by clone name and the annotation disclosed 
in the ChondroChip™ Master Annotation Table 8B. As used herein "annotation" when used 
in the context of the ChondroChip™ allows one to identify the expressed RNA and, if 
applicable, the resulting protein translated by the expressed RNA which is being measured as 
a result of the binding of RNA to ChondroChip™ microarray. The details of the annotation 
10 shown in Table 9 above. 

Measure of level of species of transcripts in Blood Using Quantitative Real Time RT- 
PCR 

In another aspect of the invention, the level of one or more species of transcripts of 
15 the invention can be determined using quantitative methods including QRT-PCR, RNA from 
blood (either whole blood without prior fractionation, , peripheral blood leukocytes, PBMCs 
or another subfraction of blood) using quantitative reverse transcription (RT) in combination 
with the polymerase chain reaction (PGR). 

20 Total RNA, or mRNA from blood is used as a template and a primer specific to the 

transcribed portion of a gene of the invention is used to initiate reverse transcription. Primer 
design can be accomplished utilizing commercially available software (e.g. Primer Designer 
1.0, Scientific Sofware etc.). The product of the reverse transcription is subsequently used as 
a template for PGR. 

25 PGR provides a method for rapidly amplifying a particular nucleic acid sequence by 

using multiple cycles of DNA replication catalyzed by a thermostable, DNA-dependent DNA 
polymerase to amplify the target sequence of interest. PCR requires the presence of a nucleic 
acid to be amplified, two single-stranded oligonucleotide primers flanking the sequence to be 
amplified, a DNA polymerase, deoxyribonucleoside triphosphates, a buffer and salts. 

30 

The method of PCR is well known in the art. PCR, is performed as described in 
Mullis and Faloona, 1987, Methods EnzymoL, 155: 335, herein incorporated by reference. 
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PCR is performed using template DNA or cDNA (at least lfg; more usefully, 1-1000 
ng) and at least 25 pmol of oligonucleotide primers. A typical reaction mixture includes: 2jil 
of DNA, 25 pmol of oligonucleotide primer, 2.5 pi of 10D PCR buffer 1 (Perkin-Elmer, 
Foster City, CA), 0.4 (xl of 1.25 yM dNTP, 0.15 yd (or 2.5 units) of Taq DNA polymerase 
5 (Perkin Elmer, Foster City, CA) and deionized water to a total volume of 25 pi. Mineral oil 
is overlaid and the PCR is performed using a programmable thermal cycler. 

The length and temperature of each step of a PCR cycle, as well as the number of 
cycles, are adjusted according to the stringency requirements in effect. Annealing 
0 temperature and timing are determined both by the efficiency with which a primer is expected 
to anneal to a template and the degree of mismatch that is to be tolerated. The ability to 
optimize the stringency of primer annealing conditions is well within the knowledge of one of 
moderate skill in the art. An annealing temperature of between 30°C and 72°C is used. 
Initial denaturation of the template molecules normally occurs at between 92°C and 99°C for 
L5 4 minutes, followed by 20-40 cycles consisting of denaturation (94-99°C for 15 seconds to 1 
minute), annealing (temperature determined as discussed above; 1-2 minutes), and extension 
(72°C for 1 minute). The final extension step is generally carried out for 4 minutes at 72°C, 
and may be followed by an indefinite (0-24 hour) step at 4°C. 

20 QRT-PCR which is quantitative in nature can also be performed, using either reverse 

transcription and PCR in a two step procedure, or reverse transcription combined with PCR 
in a single step protocol so as to provide a quantitative measure of the level of one or more 
species of RNA transcripts in blood. One of these techniques, for which there are 
commercially available kits such as Taqman® (Perkin Elmer, Foster City, CA), is performed 

25 with a transcript-specific antisense probe. This probe is specific for the PCR product (e.g. a 
nucleic acid fragment derived from a gene) and is prepared with a quencher and fluorescent 
reporter probe complexed to the 5 5 end of the oligonucleotide. Different fluorescent markers 
are attached to different reporters, allowing for measurement of two products in one reaction. 
When Taq DNA polymerase is activated, it cleaves off the fluorescent reporters of the probe 

30 bound to the template by virtue of its 5'-to-3' exonuclease activity. In the absence of the 
quenchers, the reporters now fluoresce. The color change in the reporters is proportional to 
the amount of each specific product and is measured by a fluorometer; therefore, the amount 
of each color is measured and the PCR product is quantified. The PCR reactions are 
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performed in 96 well plates so that samples derived from many individuals are processed and 
measured simultaneously. The Taqman® system has the additional advantage of not 
requiring gel electrophoresis and allows for quantification when used with a standard curve. 

5 A second technique useful for detecting PGR products quantitatively without is to use 

an intercolating dye such as the commercially available QuantiTect™ SYBR® Green PCR 
(Qiagen, Valencia California). RT-PCR is performed using SYBR® green as a fluorescent 
label which is incorporated into the PCR product during the PCR stage and produces a 
flourescense proportional to the amount of PCR product. 

10 

Both Taqman® and QuantiTect™ SYBR® systems can be used subsequent to reverse 
transcription of RNA. 

Additionally, other systems to quantitatively measure the level of one or more species 
15 of transcripts are known including Molecular Beacons® which uses a probe having a 
fluorescent molecule and a quencher molecule, the probe capable of forming a hairpin 
structure such that when in the hairpin form, the fluorescence molecule is quenched, and 
when hybridized the flourescense increases giving a quantitative measurement of one or more 
species of RNA transcripts. 

20 

Several other techniques for detecting PCR products quantitatively without 
electrophoresis may also be used according to the invention (see for example PCR Protocols, 
A Guide to Methods and Applications, Innis et al., Academic Press, Inc. N.Y., (1990)). 

25 Identification of Useful Biomarkers 

Using techniques which allow comparison as to the levels of one or more species of 
RNA transcripts in blood as described herein, one can identify useful biomarkers of a 
condition. For example one can identify those biomarkers which identify differential levels 
of one or more species of transcripts as between, for example, an individual or a population 
30 of individuals having a condition and an individual or a population of individuals not having 
a condition. 
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When comparing two or more samples for differences, results are reported as 
statistically significant when there is only a small probability that similar results would have 
been observed if the tested hypothesis (i.e., the RNA transcripts are not expressed at different 
levels) were not true. A small probability can be defined as the accepted threshold level at 
5 which the results being compared are considered significantly different. The accepted lower 
threshold is set at, but not limited to, 0.05 (i.e., there is a 5% likelihood that the results would 
be observed between two or more identical populations) such that any values determined by 
statistical means at or below this threshold are considered significant. 

When comparing two or more samples for similarities, results are reported as 
statistically significant when there is only a small probability that similar results would have 
been observed if the tested hypothesis (i.e., the genes are not expressed at different levels) 
were not true. A small probability can be defined as the accepted threshold level at which the 
results being compared are considered significantly different. The accepted lower threshold is 
set at, but not limited to, 0.05 (i.e., there is a 5% likelihood that the results would be observed 
between two or more identical populations) such that any values determined by statistical 
means above this threshold are not considered significantly different and thus similar. 

Preferably the identification of biomarkers is done using statistical analysis. For 
example, the Wilcox Mann Whitney rank sum test or a standard modified t-test such as a 
permutation t-test can be used. Additionally multigroup comparisons can also be done when 
there are three or more reference populations. In this case one can use statistical tests such as 
AN OVA or Kruskal Wallis which can then be analyzed using a post-hoc pairwise test such 
as the t-test, the Tukey test, or the student-Newman-Keuls test. Other multiclass comparison 
tests can also be used as would be understood by a person skilled in the art. See for example 
(Sokal and Rohlf (1987) Introduction to Biostatistics 2 nd edition, WH Freeman, New York), 
Yeung and Bumgarner, Multiclass classification of microarray data with repeated 
measurements: application to cancer Genome Biology 2003, 4:R83; Breiman, L. (2001) 
Statistical Modeling, the two cultures Statistical Science 16(3) 199-231 which are 
incorporated herein in their entirety. 

In order to facilitate ready access, e.g. for comparison, review, recovery and/or 
30 modification, the expression profiles of patients with a condition or without a condition can 
be recorded in a database, whether in a relational database accessible by a computational 
device or other format, or a manually accessible indexed file of profiles as photographs, 
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analogue or digital imaging, readouts spreadsheets etc. Typically the database is compiled 
and maintained at a central facility, with access being available locally and/or remotely. 



As would be understood by a person skilled in the art, comparison as between the the 
level of one or more species of transcripts in blood as illustrated by an expression profileof a 

5 test individual suspected of having a condition of interest, with that of individuals with 

thecondition of interest, as well as an analogous comparison of expression profiles between 
individuals with a certain stage or degree of progression of a disease condition, without said 
condition, or a healthy ("normal") individual ,so as to diagnose or prognose said test 
individual can occur via expression profiles generated concurrently or non concurrently. It 

10 would be understood that a database would be useful to generate said comparison. 

As additional test samples from test patients are obtained, through clinical trials, 
further investigation, or the like, additional data can be determined in accordance with the 
methods disclosed herein and can likewise be added to a database to provide better reference 
data for comparison of healthy and/or non-disease patients and/or certain stage or degree of 
15 progression of a disease as compared with the test patient sample. 

The ability to combine biomarkers provides an even greater potential to help 
distinguish as between two populations so as to allow diagnosis of a disease or condition. In 
order to identify useful combitions of biomarkers, each potential combination or set of 
biomarkers are evaluated for their ability to diagnose an unknown as having or not having a 
20 specific condition. 

The diagnosing or pro gno sing may thus be performed by detecting the expression 
level of one gene, two or more genes, three or more genes, four or more genes, five or more 
genes, six or more genes, seven or more genes, eight or more genes, nine or more genes, ten 
or more genes, fifteen or more genes, twenty or more genes thirty or more genes, fifty or 
25 more genes, one hundred or more genes, two hundred or more genes, three hundred or more 
genes, five hundred or more genes or all of the genes disclosed for the specific condition in 
question. 

Use of Expression Profiles for Diagnostic Purposes 
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As would be understood to a person skilled in the art, one can utilize sets of 
biomarkers which have been identified as statistically significant as described above in order 
to characterize an unknown sample as having said disease or not having said disease. This is 
commonly termed "class prediction". 

5 Methods that can be used for class prediction analysis have been well described and 

generally involve a training phase using samples with known classification and a testing 
phase from which the algorithm generalizes from the training data so as to predict 
classification of unknown samples (see for Example Slonim, D. (2002), Nature (Genetics 
Supp. Vol 32 502-8, Raychaudhuri et al. (2001) Trends Biotechnol 19: 189-193; Khan et al. 
10 (2001) Nature Med. 7 673-9.; Golub et al. (1999) Science 286: 531-7. Hastie et al. (2000) 
Genome Biol. 1(2) Research 0003.1-0003.21 all of which are incorporated herein by 
reference in their entirety ). 

Use of Expression Profiles to Predict Disease State 

15 One can also utilize sets of genes which have been identified as producing differential 

levels of transcripts in blood which are statistically significant as described above in order to 
predict whether an asymptomatic individual will develop symptoms of said condition or 
whether an individual with an early stage of a disease condition will develop a later stage of a 
disease condition. 

20 For example, as a result of analyzing over 780 individuals, we have surprisingly 

shown that almost all individuals in the 56 and over age group have either moderate, marked 
or severe OA, and furthermore that almost all individuals in the 61 and over age group have 
either marked or severe OA only (see Figure 35 and Table 3AE) even though there remain 
approximately 50% of Canadians over the age of 65 who do not show symptoms of 

25 osteoarthritis (Statistics Canada, Canadian Community Health Survey, 2000/2001). This data 
indicates that individuals with mild OA have a significantly increased chance of progressing 
to marked or severe OA as compared with individuals who do not have mild OA. 
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As a result, one can utilize the methods of class prediction analysis described herein 
in order to determine whether an individual will develop late stage OA by identifying 
individuals with early stages of OA. 

As additional samples are obtained, for example during clinical trials, their expression 
5 profiles can be determined and correlated with the relevant subject data in the database and 
likewise be recorded in said database. Algorithms as described above can be used to query 
additional samples against the existing database to further refine the predictive determination 
by allowing an even greater association between the prediction of OA and one or more 
species of RNA transcripts signature. 

10 The prediction of late stage OA may thus be performed by detecting the level of 

transcripts expressed by two or more genes, three or more genes, four or more genes, five or 
more genes, six or more genes, seven or more genes, eight or more genes, nine or more 
genes, ten or more genes, fifteen or more genes, twenty or more genes thirty or more genes, 
fifty or more genes, one hundred or more genes, two hundred or more genes, three hundred or 

15 more genes, five hundred or more genes or all of the genes disclosed for identifying mild OA. 
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Table 1 shows genes that are differentially expressed in blood samples from patients with a 
disease or patients who are co-morbid as compared to blood samples from healthy patients or 
5 patients without said disease, or with only one of said co-morbid diseases 

Table 1A shows the identity of those genes that are differentially expressed in blood 
samples from patients with osteoarthritis and hypertension as compared with normal 
patients using the ChondroChip™ platform. 

Table IB shows the identity of those genes that are differentially expressed in blood 
10 samples from patients with osteoarthritis and obesity as compared with normal patients 
using the ChondroChip™ platform. 

Table 1C shows the identity of those genes that are differentially expressed in blood 
samples from patients with osteoarthritis and allergies as compared with normal patients 
using the ChondroChip™ platform. 

15 Table ID shows the identity of those genes that are differentially expressed in blood 

samples from patients with osteoarthritis and subject to systemic steroids as compared with 
normal patients using the ChondroChip™ platform. 

Table IE shows the identity of those genes that are differentially expressed in blood 
samples from patients with hypertension as compared to non hypertension patients using 
20 the ChondroChip™ platform. 

Table IF shows the identity of those genes that are differentially expressed in blood 
samples from patients obesity as compared to non obese patients using the ChondroChip™ 
platform. 

Table 1G shows the identity of those genes that are differentially expressed in blood samples 
25 from patients with hypertension and OA when compared with patients who have OA only 
wherein genes identified in Table 1 A have been removed so as to identify genes which are 
unique to hypertension. 
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Table 1H shows the identity of those genes which were identified in Table 1 A which are 
shared with those genes differentially expressed in blood samples from patients with 
hypertension and OA when compared with patients who have OA only. 

Table II shows the identity of those genes that are differentially expressed in blood samples 
5 from patients who are obese and have OA when compared with patients who have OA only 
and wherein genes identified in Table IB have been removed so as to identify genes which 
are unique to obesity. 

Table 1 J shows the identify of those genes identified in Table IB which are shared with 
those genes differentially expressed in blood samples from patients who are obese and have 
10 OA when compared with patients who have OA. 

Table IK shows the identity of those genes that are differentially expressed in blood samples 
from patients with allergies and OA when compared with patients who have OA only 
wherein genes identified in Table 1C have been removed so as to identify genes which are 
unique to allergies. 

15 Table 1L shows thq identify of those genes identified in Table 3C which are shared with 
those genes differentially expressed in blood samples from patients with allergies and OA 
when compared with patients who have OA only. 

Table 1M shows the identity of those genes that are differentially expressed in blood samples 
from patients who are on systemic steroids and have OA when compared with patients who 
20 have OA only wherein genes identified in Table ID have been removed so as to identify 
genes which are unique to patients on systemic steroids. 

Table IN shows the identify of those genes identified in Table ID which are shared with 
those genes differentially expressed in blood samples from patients who are on systemic 
steroids and have OA when compared with patients who have OA only. 

25 Table lO shows the identity of those genes that are differentially expressed in blood from 
patients taking either birth control, prednisone or hormone replacement therapy and 
presenting with OA using the ChondroChip™ platform. 
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Table IP shows the identity of those genes that are differentially expressed in blood 
samples from patients with type II diabetes as compared to patients without type II diabetes 
using the ChondroChip™ platform. 

Table 1Q shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with Hyperlipidemia as compared to patients without Hyperlipidemia 
using the ChondroChip™ platform. 

Table 1R shows the identity of those genes that are differentially expressed in blood 
samples from patients with lung disease as compared to patients without lung disease using 
the ChondroChip™ platform. 

10 Table IS shows the identity of those genes that are differentially expressed in blood 

samples from patients with bladder cancer as compared to patients without bladder cancer 
using the ChondroChip™ platform. 

Table IT shows the identity of those genes that are differentially expressed in blood 
samples from patients with early stage bladder cancer, late stage bladder cancer or non- 
15 bladder cancer using the ChondroChip™ platform. 

Table 1U shows the identity of those genes that are differentially expressed in blood 
samples from patients with coronary artery disease (CAD) as compared to patients not 
having CAD using the ChondroChip™ platform. 

Table IV shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with rheumatoid arthritis as compared to patients not having 
rheumatoid arthritis using the ChondroChip™ platform . 

Table 1W shows the identity of those genes that are differentially expressed in blood 
samples from patients with rheumatoid arthritis as compared to patients not having 
rheumatoid arthritis using the Affymetrix® platform . 

25 Table IX shows the identity of those genes that are differentially expressed in blood 
samples from patients with depression as compared with patients not having depression 
using the ChondroChip™ platform. 
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Table 1Y shows the identity of those genes that are differentially expressed in blood 
samples from patients with various stages of osteoarthritis using the ChondroChip™ 
platform. 

Table 1Z shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with liver cancer as compared with patients not having liver cancer 
using the Affymetrix® platform. 

Table 1AA shows the identity of those genes that are differentially expressed in blood 
samples from patients with schizophrenia as compared with patients not having 
schizophrenia using the Affymetrix® platform. 

10 Table 1 AB shows the identity of those genes that are differentially expressed in blood 
samples from patients with Chagas disease as compared with patients not having Chagas 
disease using the Affymetrix® platform. 

Table 1 AC shows the identity of those genes that are differentially expressed in blood 
samples from patients with asthma as compared with patients not having asthma using the 
15 ChondroChip™. 

Table IAD shows the identity of those genes that are differentially expressed in blood 
samples from patients with asthma as compared with patients not having asthma using the 
Affymetrix® platform. 

Table 1AE shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with lung cancer as compared with patients not having lung cancer 
using the Affymetrix® platform. 

Table 1AG shows the identity of those genes that are differentially expressed in blood 
samples from patients with hypertension as compared with patients not having hypertension 
using the Affymetrix® platform. 

25 Table 1 AH shows the identity of those genes that are differentially expressed in blood 

samples from patients with obesity as compared with patients not having obesity using the 
Affymetrix® platform. 
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Table 1AI shows the identity of those genes that are differentially expressed in blood 
samples from patients with ankylosing spondylitis using the Affymetrix® platform. 

Table 2 shows the identity of those genes that are differentially expressed in blood from 
patients with either mild or severe OA, but for which genes relevant to asthma, obesity, 
5 hypertension, systemic steroids and allergies have been removed. 

Table 3 shows those genes that are differentially expressed in blood samples from patients 
with a first disease as compared to blood samples from patients with a second disease so as 
to allow differential diagnosis as between said first and second disease. 

Table 3 A shows the identity of those genes that are differentially expressed in blood from 
10 patients with schizophrenia as compared with manic depression syndrome (MDS) using the 
Affymetrix® platform. 

Table 3B shows the identity of those genes that are differentially expressed in blood from 
patients with hepatitis as compared with liver cancer using the Affymetrix® platform. 

/ 

Table 3C shows the identity of those genes that are differentially expressed in blood from 
15 patients with bladder cancer as compared with liver cancer using the Affymetrix® platform. 

Table 3D shows the identity of those genes that are differentially expressed in blood from 
patients with bladder cancer as compared with testicular cancer using the Affymetrix® 
platform. 

Table 3E shows the identity of those genes that are differentially expressed in blood from 
20 patients with testicular cancer as compared with kidney cancer using the Affymetrix® 
platform. 

Table 3F shows the identity of those genes that are differentially expressed in blood from 
patients with liver cancer as compared with stomach cancer using the Affymetrix® 
platform. 

25 Table 3G shows the identity of those genes that are differentially expressed in blood from 
patients with liver cancer as compared with colon cancer using the Affymetrix® platform. 
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Table 3H shows the identity of those genes that are differentially expressed in blood from 
patients with stomach cancer as compared with colon cancer using the Affymetrix® 
platform. 

Table 31 shows the identity of those genes that are differentially expressed in blood from 
5 patients with Rheumatoid Arthritis as compared with Osteoarthritis using the Affymetrix® 
platform. 

Table 3K shows the identity of those genes that are differentially expressed in blood from 
patients with Chagas Disease as compared with Heart Failure using the Affymetrix® 
platform. 

10 Table 3L shows the identity of those genes that are differentially expressed in blood from 
patients with Chagas Disease as compared with Coronary Artery Disease using the 
Affymetrix® platform. 

Table 3N shows the identity of those genes that are differentially expressed in blood from 
patients with Coronary Artery Disease as compared with Heart Failure using the 
15 Affymetrix® platform. 

Table 3P shows the identity of those genes that are differentially expressed in blood from 
patients with Asymptomatic Chagas Disease as compared with Symptomatic Chagas 
Disease using the Affymetrix® platform. 

Table 3Q shows the identity of those genes that are differentially expressed in blood from 
20 patients with Alzheimer's' as compared with Schizophrenia using the Affymetrix® 
platform. 

Table 3R shows the identity of those genes that are differentially expressed in blood from 
patients with Alzheimer's' as compared with Manic Depression Syndrome using the 
Affymetrix® platform. 

25 Table 4 shows those genes that are differentially expressed in blood samples from patients 
with a stage of Osteoarthritis as compared to blood samples from patients with a second 
stage of Osteoarthritis so as to allow monitoring of progression and/or regression of 
disease. 
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Table 4 A shows the identity of those genes that are differentially expressed in blood from 
patients with Osteoarthritis as compared with patients without Osteoarthritis using the 
ChondroChip™ platform. 

Table 4B shows the identity of those genes that are differentially expressed in blood from 
patients with Osteoarthritis as compared with patients without Osteoarthritis using the 
Affymetrix® platform. 

Table 4C shows the identity of those genes that are differentially expressed in blood from 
patients with mold Osteoarthritis as compared with patients without mild Osteoarthritis 
using the ChondroChip™ platform. 

Table 4D shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients without Osteoarthritis using the 
Affymetrix® platform. 

Table 4E shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients without Osteoarthritis 
using the ChondroChip™ platform. 

Table 4F shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients without Osteoarthritis 
using the Affymetrix® platform. 

Table 4G shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients without Osteoarthritis using 
the ChondroChip™ platform. 

Table 4H shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients without Osteoarthritis using 
the Affymetrix® platform. 

Table 41 shows the identity of those genes that are differentially expressed in blood from 
patients with severe Osteoarthritis as compared with patients without Osteoarthritis using 
the ChondroChip™ platform. 
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Table 4 J shows the identity of those genes that are differentially expressed in blood from 
patients with severe Osteoarthritis as compared with patients without Osteoarthritis using 
the Affymetrix® platform. 

Table 4K shows the identity of those genes that are differentially expressed in blood from 
5 patients with mild Osteoarthritis as compared with patients with moderate Osteoarthritis 
using the ChondroChip™ platform. 

Table 4L shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with moderate Osteoarthritis 
using the Affymetrix® platform. 

10 Table 4M shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with marked Osteoarthritis 
using the ChondroChip™ platform. 

Table 4N shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with marked Osteoarthritis 
15 using the Affymetrix® platform. 

Table 40 shows the identity of those genes that are differentially expressed in blood from 
patients with mild Osteoarthritis as compared with patients with severe Osteoarthritis using 
the ChondroChip™ platform. 

Table 4P shows the identity of those genes that are differentially expressed in blood from 
20 patients with mild Osteoarthritis as compared with patients with severe Osteoarthritis using 
the Affymetrix® platform. 

Table 4Q shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with marked Osteoarthritis 
using the ChondroChip™ platform. 

25 Table 4R shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with marked Osteoarthritis 
using the Affymetrix® platform. 
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Table 4S shows the identity of those genes that are differentially expressed in blood from 
patients with moderate Osteoarthritis as compared with patients with severe Osteoarthritis 
using the ChondroChip™ platform. 

Table 4T shows the identity of those genes that are differentially expressed in blood from 
5 patients with moderate Osteoarthritis as compared with patients with severe Osteoarthritis 
using the Affymetrix® platform. 

Table 4U shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients with severe Osteoarthritis 
using the ChondroChip™ platform. 

10 Table 4V shows the identity of those genes that are differentially expressed in blood from 
patients with marked Osteoarthritis as compared with patients with severe Osteoarthritis 
using the Affymetrix® platform. 

Table 5 shows those genes that are differentially expressed in blood samples from patients 
with a disease or condition of interest as compared to blood samples from patients without 
15 said disease or condition. 

Table 5A shows the identity of those genes that are differentially expressed in blood 
samples from patients with psoriasis as compared with patients not having hypertension 
using the Affymetrix® platform. 

Table 5B shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with thyroid disorder as compared with patients not having thyroid 
disorder using the Affymetrix® platform. 

Table 5C shows the identity of those genes that are differentially expressed in blood 
samples from patients with irritable bowel syndrome as compared with patients not having 
irritable bowel syndrome using the Affymetrix® platform. 

25 Table 5D shows the identity of those genes that are differentially expressed in blood 

samples from patients with osteoporosis as compared with patients not having osteoporosis 
using the Affymetrix® platform. 
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Table 5E shows the identity of those genes that are differentially expressed in blood 
samples from patients with migraine headaches as compared with patients not having 
migraine headaches using the Affymetrix® platform. 

Table 5F shows the identity of those genes that are differentially expressed in blood 
samples from patients with eczema as compared with patients not having eczema using the 
Affymetrix® platform. 

Table 5G shows the identity of those genes that are differentially expressed in blood 
samples from patients with NASH as compared with patients not having NASH using the 
Affymetrix® platform. 

Table 5H shows the identity of those genes that are differentially expressed in blood 
samples from patients with alzheimers' disease as compared with patients not having 
alzheimer's disease using the Affymetrix® platform. 

Table 51 shows the identity of those genes that are differentially expressed in blood 
samples from patients with Manic Depression Syndrome as compared with patients not 
having Manic Depression Syndrome using the Affymetrix® platform. 

Table 5J shows the identity of those genes that are differentially expressed in blood 
samples from patients with Crohn's Colitis as compared with patients not having Crohn's 
Colitis using the Affymetrix® platform. 

Table 5K shows the identity of those genes that are differentially expressed in blood 
samples from patients with Chronis Cholecystits as compared with patients not having 
Chronis Cholecystits using the Affymetrix® platform. 

Table 5L shows the identity of those genes that are differentially expressed in blood 
samples from patients with Heart Failure as compared with patients not having Heart 
Failure using the Affymetrix® platform. 

Table 5M shows the identity of those genes that are differentially expressed in blood 
samples from patients with Cervical Cancer as compared with patients not having Cervical 
Cancer using the Affymetrix® platform. 



61 



WO 2004/112589 PCT/US2004/020836 
Table 5N shows the identity of those genes that are differentially expressed in blood 
samples from patients with Stomach Cancer as compared with patients not having Stomach 
Cancer using the Affymetrix® platform. 

Table SO shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with Kidney Cancer as compared with patients not having Kidney 
Cancer using the Affymetrix® platform. 

Table 5P shows the identity of those genes that are differentially expressed in blood 
samples from patients with Testicular Cancer as compared with patients not having 
Testicular Cancer using the Affymetrix® platform. 

10 Table 5Q shows the identity of those genes that are differentially expressed in blood 
samples from patients with Colon Cancer as compared with patients not having Colon 
Cancer using the Affymetrix® platform. 

Table 5R shows the identity of those genes that are differentially expressed in blood 
samples from patients with Hepatitis B as compared with patients not having Hepatitis B 
15 using the Affymetrix® platform. 

Table 5S shows the identity of those genes that are differentially expressed in blood 
samples from patients with Pancreatic Cancer as compared with patients not having 
Pancreatic Cancer using the Affymetrix® platform. 

Table 5T shows the identity of those genes that are differentially expressed in blood 
20 samples from patients with Asymptomatic Chagas as compared with patients not having 
Chagas using the Affymetrix® platform. 

Table 5U shows the identity of those genes that are differentially expressed in blood 
samples from patients with Symptomatic Chagas as compared with patients not having 
Chagas using the Affymetrix® platform. 

25 Table 5V shows the identity of those genes that are differentially expressed in blood 

samples from patients with Bladder Cancer as compared with patients not having Bladder 
Cancer using the Affymetrix® platform. 
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Table 6 shows those genes that are differentially expressed in blood samples from patients 
with any one of a series of related conditions as compared to blood samples from patients 
without said related conditions. 

Table 6A shows the identity of those genes that are differentially expressed in blood 
5 samples from patients with Cancer as compared with patients not having Cancer using the 
Affymetrix® platform. 

Table 6B shows the identity of those genes that are differentially expressed in blood 
samples from patients with Cardiovascular Disease as compared with patients not having a 
Cardiovascular Disease using the Affymetrix® platform. 

10 Table 6C shows the identity of those genes that are differentially expressed in blood 

samples from patients with a Neurological Disease as compared with patients not having a 
Neurological Disease using the Affymetrix® platform. 

Table 7 shows those genes that are differentially expressed in blood samples from with a 
condition wherein said condition is a treatment as compared to blood samples from patients 
15 without said condition. 

Table 7A shows the identity of those genes that are differentially expressed in blood 
samples from patients taking Celebrex® as compared with patients on a Cox Inhibitor 
which was not Celebrex® using the ChondroChip™ platform. 

Table 7B shows the identity of those genes that are differentially expressed in blood 
20 samples from patients taking Celebrex® as compared with patients not on Celebrex® using 
the ChondroChip™ platform. 

Table 7C shows the identity of those genes that are differentially expressed in blood 
samples from patients taking Vioxx® as compared with patients not on Vioxx® using the 
ChondroChip™ platform. 

25 Table 7D shows the identity of those genes that are differentially expressed in blood 

samples from patients taking Vioxx® as compared with patients on a Cox inhibitor but not 
on Vioxx® using the ChondroChip™ platform. 
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Table 7E shows the identity of those genes that are differentially expressed in blood 
samples from patients taking NSAIDS as compared with patients not on NSABDS using the 
ChondroChip™ platform. 

Table 7F shows the identity of those genes that are differentially expressed in blood 
5 samples from patients taking Cortisone as compared with patients not on Cortisone using 
the ChondroChip™ platform. 

Table 7G shows the identity of those genes that are differentially expressed in blood 
samples from patients taking Visco Supplement as compared with patients not on Visco 
Supplement using the ChondroChip™ platform. 

10 Table 7H shows the identity of those genes that are differentially expressed in blood 

samples from patients taking Lipitor® as compared with patients not on Lipitor® using the 
ChondroChip™ platform. 

Table 71 shows the identity of those genes that are differentially expressed in blood 
samples from patients who are smokers as compared with patients who are not smokers 
1 5 using the ChondroChip™ platform 

Table 8A is an annotation table showing the relationship between the gene ID identified in 
Tables 1-7 wherein the data was generated using the Affymetrix® platform and gene 
identified by the Affymetrix probe. 

Table 8B is an annotation table showing the relationship between the clone ID identified in 
20 Tables 1-7 wherein the data was generated using the ChondroChip™ platform and the gene 
identified by the EST clones. 

Table 9 shows the descriptions as to the various annotations provided for both the 
ChondroChip™ and the Affymetrix® microarray results. 

Table 10 shows how the incidence of different stages of OA varies with respect to age in 
25 males and females 

Table 11 shows 223 EST sequences of Tables 1 A- 71 with "no-significant match" to known 
gene sequence in Patent-In Format. 
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Table 12 shows a list of genes showing greater than two fold differential expression in 
CAD peripheral blood cells relative to that of normal blood cells. 



The following examples are given for the purpose of illustrating various 
embodiments of the invention and are not meant to limit the present invention in any 
fashion. 



EXAMPLE 1 

Blood cDNA chip Microarray Data Analysis of RNA expression profiles of blood 
samples from individuals having coronary artery disease as compared with RNA 
expression profiles from normal individuals. 

A microarray was constructed using cDNA clones from a human peripheral blood 
cell cDNA library, as described herein. A total of 10,368 polymerase chain reaction (PGR) 
products of the clones from the human peripheral blood cell cDNA library described herein 
were arrayed using GNS 417 arrayer (Affymetrix). RNA for microarray analysis was 
isolated from whole blood samples without prior fractionation, obtained from three male 
and one female patients with coronary heart disease (80 - 90% stenosis) receiving vascular 
extension drugs and awaiting bypass surgery, and three healthy male controls. 

A method of high-fidelity mRNA amplification from lpg of total RNA sample was 
used. Cy5- or Cy3-dUTP was incorporated into cDNA probes by reverse transcription of 
anti-sense RNA, primed by oligo-dT. Labeled probes were purified and concentrated to the 
desired volume. Pre- hybridization and hybridization were performed following Hegde's 
protocol (Hegde P et al., A concise guide to cDNA microarray analysis. Biotechniques 
2000;29: 548 - 56). After overnight hybridization and washing, hybridization signals were 
detected with a GMS 418 scanner at 635-nm (Cy5) and 532-nm (Cy3) wave lengths (see 
Figure 17). Two RNA pools were labeled alternatively with Cy5- and Cy3-dUTP, and each 
experiment was repeated twice. Cluster analysis using GeneSpring™ 4.1.5 (Silicon 
Genetics) revealed two distinct groups consisting of four CAD and three normal control 
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samples. Two images scanned at different wavelengths were super- imposed. Individual 
spots were identified on a customized grid. Of 10,368 spots, 10,012 (96.6%) were selected 
after the removal of spots with irregular shapes. Data quality was assessed with values of 
ChlGTB2 and Ch2GTB2 provided by ScanAlyze. Only spots with CWGTB2 and 
5 Ch2GTB2 over 0.50 were selected. After evaluation of signal intensities, 8750 (84.4%) 
spots were left. Signal intensities were normalized using a scatter-plot of the signal 
intensities of the two channels. After normalization, the expression ratios of B-actin were 
1.00 + 0 21 1 11 + 0.22, 1.14 + 0.20 and 1.30 + 0.18 (24 samples of 6-actin were spotted on 
this slide as the positive control) in the four images. Differential expression of RNA was 

10 assessed as the ratio of two wave-length signal intensities. Spots showing a differential 
expression more than twofold relative to normal in all four experiments were identified as 
peripheral blood cell, differentially expressed candidate genes in CAD. 108 genes are 
differentially expressed in CAD peripheral blood cells. 43 genes are downregulated in 
CAD blood cells and 65 are upregulated (see Table 12). Functional characterization of 

15 these genes from which the differentially expressed RNA transcripts were transcribed 
shows that differential expression at the level of RNA transcription takes place in every 
gene functional category, indicating that profound changes occur in peripheral blood cells 
from patients with CAD. 

The differential expression of RNA transcribed from three genes, pro-platelet basic 
20 protein (PBP), platelet factor 4 (PF4) and coagulation factor XIII Al (F13A), initially 
identified in the microarray data analysis, was further examined by reverse transcriptase- 
PGR (RT-PCR) using the Titan One-tube RT-PCR kit (Boehringer Mannheim). Reaction 
solution contains 0.2 mM each dNTP, 5 mM DTT, 1.5 mM MgCl 0.1 pg of total RNA 
from each sample and 20 pmol each of left and right primers of PBP (5 - 
25 GGTGCTGCTGCTTCTGTC AT-3 1 (SEQ ID NO: 224) and S'-GGCAGATTTT 

CCTCCCATCC-3 1 ), (SEQ ID NO:225), F13A (5 , -AGTCCACCGTGCTAACCATC-3 ? 
(SEQ ID NO:226), and 5'-AGGGAGTCACTGCTCATGCT-3') (SEQ ID NO:227), and 
PF4 (5' GTTGCTGCTCCTGCCACTT 3' (SEQ ID NO:228), and 5' 
GTGGCTATCAGTTGGGCAGT-3')(SEQ ID NO:229). RT-PCR steps are as follows: 1. 
10 reverse-transcription: 30 min at 60 °C; 2. PGR: 2 min at 94 °C, followed by 30 - 35 cycles 
(as optimized for each gene) for 30 s at 94 °C, 30 s at optimized annealing temperature and 
2 min at 68 °C; 3. final extension: 7 min at 68 °C. PCR products were electrophoresed on 
1.5% agarose gels. Human (DlJ -actin primers (5 '-GCGAGAAGATGACCC AGATC AT-3 ' 
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(SEQ ID NO:230) and 5 -GCTC AGGAGGAGC AATGATCTT-3 (SEQ ID NO:231) were 
used as the internal control. The RT-PCR analysis confirmed that the expression of the 
three secreted proteins: PBP, PF4 and F13A were all upregulated in CAD blood cells (see 
Figures 27 and 17) 



TABLE 12 



Upregulated gene in CAD 
REV3-like, catalytic subunit 
of DNA polymerase zeta 
TGFB1 -induced anti- 
apoptotic factor 1 
A disintegrin and 
metalloproteinase domain 10 
Centaurin, delta 2 
Chloride intracellular 
channel 4 

Endothelin receptor typeA 
Glutamate receptor, 
ionotropic 

Mitogen-activated protein 
kinase 7 

Mitogen-activated protein 
kinase kinase kinase 7 
Myristoylated alanine-rich 
protein kinase C substrate 
NEVLA-related kinase 7 
PAK2 

Phospholipid scramblase 1 
Serum deprivation response 



Accession 
number 

AF035537 

D86970 

AA044656 

AA351412 
AA411940 

D90348 
N33821 

L38486 

AB009356 

D10522 

AA093324 
AA262968 
AA054476 
Z30112 



Fold 

(averag 
e) 



2.3 



2.2 



Functional 
category 



Protein 
Accession 
Number 



Cell cycle NP_002903 



Cell cycle NP_5 10880 



2.7 Cell signaling NP J)01 101 



2 Cell signaling 

2.2 Cell signaling 

2.1 Cell signaling 

2.4 Cell signaling 

3.7 Cell signaling 

4.5 Cell signaling 



NP_631920 
NPJ339234 

NP_001948 
NP_777567 

NPJ)02395 

NP 663306 



2.5 Cell signaling NP_002347 



3.5 Cell signaling 

3.5 Cell signaling 

3.3 Cell signaling 

4.5 Cell signaling 



NP_598001 

Q13177 
NP_066928 
NP 004648 
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Adducin 3 


AA029158 


2.9 


Cell structure 


NP_063968 


Desmin 


AF167579 


4.4 


Cell structure 


NP_001918 


Fibromodulin 


W23613 


2.9 


Cell structure 


NP_002014 


Laminin, beta 2 


S77512 


2.2 


Cell structure 


NP_002283 


Laminin, beta 3 


L25541 


2.4 


Cell structure 


NP_000219 


Osteonectin 


Y00755 


3.1 


Cell structure 


NP_003109 


CD59 antigen pi 8-20 


W01111 


2.4 


Cell/organism 


NP_000602 








defense 




Clusterin 


M64722 


3.5 


Cell/organism 


NP_001822 








defense 




F13A 


M14539 


2.1 


Cell/organism 


NP_000120 








defense 




Defensin, alpha 1 


M26602 


4.2 


Cell/organism 


NP_004075 








defense 




PF4 


M25897 


2.1 


Cell/organism 


NP_002610 








defense 




PBP 


M54995 


5.5 


Cell/organism 


NP_002695 








defense 




E2F transcription factor 3 


D38550 


2.1 


Gene expression 


NP__001940 


Early growth response 1 


M62829 


2.7 


Gene expression 


NP001955 


Eukaryotic translation 


N86030 


2.3 


Gene expression 


NP_001393 


elongation factor 1 alpha 1 










Eukaryotic translation 


Ml 5353 


2.1 


Gene expression 


NP_0O1959 


initiation factor 4E 










F-box and WD-40 domain 


AB014596 


2.7 


Gene expression 


NP_387449 


protein id 










Makorin, ring finger protein, 
2 


AA331966 


2.1 


Gene expression 


NP_J)54879 


Non-canonical ubiquitin- 


N92776 


2.5 


Gene expression 


NP_057420 


conjugating enzyme 1 










Nuclear receptor subfamily 


Z30425 


4.7 


Gene expression 


NP_005113 


1, group I, member 3 










Ring finger protein 1 1 


T08927 


3 


Gene expression 


NP_055187 
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Transducin-like enhancer of 
split 1 

Alkaline phosphatase, 
liver/bone/kidney 
Annexin A3 
Branched chain 
aminotransferase 1, cytosolic 
Cytochrome b 
Glutaminase 
Lysophospholipase I 
NADH dehydrogenase 1, 
subcomplex unknown 1, 6 
kDa 

Phosphofructokinase 
Ubiquinol-cytochrome c 
reductase binding protein 
CGI-1 10 protein 
Dactylidin 

Deleted in split-hand/split- 
foot 1 region 
Follistatin-like 1 
FUS-interacting protein 1 
Hypothetical protein 
FLJ12619 

Hypothetical protein from 
EUROIMAGE 588495 
Hypothetical protein 
LOC51315 
KIAA1705 protein 
Mesoderm induction early 
response 1 

Phosphodiesterase 4D- 
interacting protein 



M99435 

AB011406 

M63310 
AA336265 

AF042500 

D30931 
AF035293 
AA056111 



M26066 
M22348 

AA341061 
H95397 
T24503 

R14219 
W37945 
W47233 

N68247 

AA251423 

T80569 
AI650409 

AA740661 
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3.3 Gene expression NP_005068 

2.2 Metabolism NP_000469 

3.4 Metabolism NP_005130 
4.8 Metabolism NP_005495.1 

2.5 Metabolism 

2.6 Metabolism NP 055720 
2.8 Metabolism NP_006321 
2.5 Metabolism NP 002485 



2.2 Metabolism NP_000280 

2.5 Metabolism NP_006285 

2.4 Unclassified NP_057131 

2.7 Unclassified NP_1 12225 

2.4 Unclassified NP_006295 

2.7 Unclassified NP_009016 

2.8 Unclassified NP_473357 
7 Unclassified NP_1 12201 

2.7 Unclassified 

2.2 Unclassified NP_057702 

2.7 Unclassified NP_009121.1 

2.2 Unclassified NP_065999 

2.5 Unclassified NP 055459 
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Preimplantation protein 3 
Putative nuclear protein 
ORF1-FL49 
Similar to rat nuclear 
ubiquitous casein kinase 2 
Similar to RIKEN 
Spectrin, beta 
Stromal cell-derived factor 
receptor 1 

Thioredoxin-related protein 
Transmembrane 4 
superfamily member 2 
Tumor endothelial marker 8 
Downregulated gene in CAD 
CASP8 and FADD-like 
apoptosis regulator 
CD81 antigen 
Cell division cycle 25B 
DEAD/H (Asp-Glu-Ala- 
Asp/His) box polypeptide 27 
F-box and leucine-rich repeat 
protein 1 1 
Minichromosome 
maintenance deficient 3 
associated protein 
Protein phosphatase 2, 
regulatory subunit A, alpha 
isoform 

Thyroid autoantigen 70 kDa 
A disintegrin and 
metalloproteinase domain 17 
A kinase anchor protein 13 
Calpastatin 



D59087 
W33098 

H09434 

AA297412 
AI334431 
H71558 

AA421549 
D29808 

D79964 



M33680 
M81934 
AA985699 

R98291 

HI 0286 

J02902 



J04607 
R32760 

M90360 
AF037194 



2.5 
2.8 

2.2 

2.5 
2.5 
4.1 
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Unclassified NP_056202 
Unclassified NP 115788 



Unclassified 

Unclassified 
Unclassified 



0.41 
0.4 
0.42 

0.27 

0.43 



Cell cycle 
Cell cycle 
Cell cycle 

Cell cycle 

Cell cycle 



Q9H1E3 

T02670 
Q01082 



Unclassified NP 816929 



2.8 Unclassified NP_1 10437 

2.4 Unclassified NP_004606 

2.5 Unclassified NP 444262 



AF015450 0.45 Cell cycle NP_003870 



NP_004347 
NP_068660 
NP_694705 

NP_036440 

NP 003897 



0.48 Cell cycle NP_055040 



0.25 Cell cycle NP_001460 

0.37 Cell signaling 

0.31 Cell signaling NP_658913 

0.39 Cell signaling NP_006471 
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Diacylglycerol kinase, alpha 
80kDa 

garnma-aminobutyric acid B 
receptor, 1 

Inositol polyphosphate-5- 
phosphatase, 145 kDa 
Lymphocyte-specific protein 
tyrosine kinase 
RAP1B, member of RAS 
oncogene family 
Ras association 
(RalGDS/AF-6) domain 
family 1 

CDC42-effector protein 3 
Leupaxin 
Annexin A6 
RAN-binding protein 9 
Thymosin, beta 10 
GranzymeA 

ThromboxaneA synthase 1 

Coatomer protein complex, 

subunit beta 

i 

Cold-inducible RNA-binding 
protein 

Leucine-rich repeat 
interacting protein 1 
Proteasome subunit, alpha 
type, 3 

Proteasome subunit, alpha 
type, 7 

Protein phosphatase 1G, 
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AF064770 0.44 Cell signaling NPJ301336 

AJ012187 0.42 Cell signaling NP_068705 

U84400 0.41 Cell signaling NP__005532 

X05027 0.45 Cell signaling NP_005347 

P09526 0.4 Cell signaling P09526 

AF061836 0.43 Cell signaling NPJ733835 



AF104857 


0.28 
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M36717 0.44 Gene expression NP_002930 

AF021819 0.3 Gene expression NP_009193 

Ul 603 1 0.45 Gene expression NP_003 144 

M62810 0.41 Gene expression NPJ)36383 

AF01 7306 0.3 1 Gene expression NP_003354 

AA100046 0.46 Metabolism NP_612461 

J03592 0.3 Metabolism NP_001627 

U41635 0.45 Unclassified NP_006803 

C00577 0.45 Unclassified NPJ)09198 

W00460 0.4 Unclassified NPJ)55134 

T95969 0.45 Unclassified NP_055459 

N93789 0.45 Unclassified NP_065138 

AF031588 0.22 Unclassified NP 003378 



EXAMPLE 2 

This example demonstrates the use of the claimed invention to identify biomarkers of 
hyperlipidimea and use of same. As used herein, a "biomarker" is any nucleic acid based 
5 substance that corresponds to, and can specifically identify a RNA transcript. 

As used herein, "hyperlipidemia" is defined as an elevation of lipid protein profiles 
and includes the elevation of chylomicrons, very low-density lipoproteins (VLDL), 
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intermediate-density lipoproteins (DDL), low-density lipoproteins (LDL), and/or high-density 
lipoproteins (HDL) as compared with the general population. Hyperlipidemia includes 
hypercholesterolemia and/or hypertriglyceridemia. By hypercholesterolemia, it is meant 
elevated fasting plasma total cholesterol level of>200mg/dL, and/or LDL-cholesterol levels 
5 of >130mg/dL. A desirable level of HDL-cholesterol is> 60mg/dL. By hypertriglyceridemia 
it is meant plasma triglyceride (TG) concentrations of greater than the 90 th or 95 th percentile 
for age and sex and can include, for example, TG > 160mg/dL as determined after an 
overnight fast. 

The level of one or more RNA transcripts expressed in blood obtained from one or 
more individuals with hyperlipidemia was determined as follows. Whole blood samples were 
taken from patients who were diagnosed with hyperlipidemia as defined herein. In each 
case, the diagnosis of hyperlipidemia was corroborated by a skilled Board certified physician. 
Total mRNA from lysed blood was isolated using TRIzol® reagent (GLBCO). 
Fluorescently labeled probes for each blood sample were generated as described above. Each 
probe was denatured and hybridized to a 15K Chondrogene Micro array Chip 
(ChondroChip™) and/or an Affymetrix GeneChip® microarray as described herein. The 
presence of a fluorescent dye on the microarray indicates hybridization of a target nucleic 
acid and a specific nucleic acid member on the microarray. The intensities of fluorescence 
dye represent the amount of target nucleic acid which is hybridized to the nucleic acid 
member on the microarray, and is indicative of the expression level of the specific nucleic 
acid member sequence in the target sample. 

Those transcripts which display differing levels with respect to the levels of those 
from patients unaffected by hyperlipidemia were identified as being biomarkers for said 
disease of interest. Identification of genes differentially expressed in whole blood samples 
25 from patients with hyperlipidemia as compared to healthy patients was determined by 
statistical analysis using the Wilcox Maim Whitney rank sum test. 

Classification or class prediction of a test sample as either having hypertension and 
OA or being normal can be done using the differentially expressed genes as shown in Table 
1A in combination with well known statistical algorithms for class prediction as would be 
30 understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 
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Figure 13 shows a diagrammatic representation of RNA expression profiles of whole 
blood samples from individuals who were identified as having hyperlipidemia as described 
herein as compared with RNA expression profiles from normal and non-hyperlipidemia 
patients. Expression profiles were generated using GeneSpring™ software analysis as 
5 described herein. Each column represents the hybridization pattern resulting from a single 
individual. Normal individuals have no known medical conditions and were not taking any 
known medication. Non hyperlipidemia individuals presented without elevated cholesterol or 
elevated triglycerides but may have presented with other medical conditions and may be 
under various treatment regimes. 

A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who have elevated lipids and/or cholesterol, are normal or do not have 
elevated lipids or cholesterol. The "*" indicates those patients who abnormally clustered as 
having either hyperlipidemia, normal or non-hyperlipidemia despite actual presentation. The 
number of hybridizations profiles determined for hyperlipidemia patients, non- 
hyperlipidemia patients and normal individuals are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical analysis as described herein and those genes identified with a p value 
of < 0.05 as between the patients with hyperlipidemia as compared with patients without 
hyperlipidemia are shown in Table 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with hyperlipidemia can be done using the differentially expressed 
genes as shown in Table ID in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

In addition to Hyperlipidemia, biomarkers for the following diseases were identified 
using the above method steps to identify one or more genetic markers for the following 
diseases; Type II Diabetes, Hypertension, Obesity, Lung Disease, Bladder Cancer, Coronary 
30 Artery Disease, Rheumatoid Arthritis, Depression, Osteoarthritis, Liver Cancer, 

Schizophrenia, Chagas Disease, Asthma, Lung Cancer, Heart Failure, Psoriasis, Thyroid 
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Disorder, Irritable Bowel Syndrome, Osteoporosis, Migraine Headaches, Eczema, NASH, 
Alzheimer's Disease, Manic Depression Syndrom, Crohn's Colitis, Chronic Cholecystits, 
Cervical Cancer, Stomach Cancer, Kidney Cancer, Testicular Cancer, Colon Cancer, 
Hepatitis B, and Pancreatic Cancer. 

5 Diabetes 

This example demonstrates the use of the claimed invention to identify biomarkers of 
diabetes and use of same. 

As used herein, "diabetes", or "diabetes mellitus" includes both "type 1 diabetes" 

10 (insulin-dependent diabetes (IDDM)) and "type 2 diabetes" (insulin-independent diabetes 
(NIDDM). Both type 1 and type 2 diabetes characterized in accordance with Harrison's 
Principles of Internal Medicine 14th edition, as a person having a venous plasma glucose 
concentration > 140mg/dL on at least two separate occasions after overnight fasting and 
venous plasma glucose concentration > 200mg/dL at 2 h and on at least one other occasion 

15 during the 2-h test following ingestion of 75g of glucose. Patients identified as having type 2. 
diabetes as described herein are those demonstrating insulin-independent diabetes as 
determined by the methods described above. Whole blood samples were taken from patients 
who were diagnosed with type 2 diabetes as defined herein. In each case, the diagnosis of 
type 2 diabetes was corroborated by a skilled Board certified physician. Figure 12 shows a 

20 diagrammatic representation of RNA expression profilesRNA expression profiles of Whole 
blood samples from individuals who were identified as having type 2 diabetes as described 
herein as compared RNA expression profilesRNA expression profiles from individuals not 
having type 2 diabetes. RNA expression profilesRNA expression profiles were generated 
using GeneSpring™ software analysis as described herein. Hybridizations to create said 

25 RNA expression profilesRNA expression profiles were done using the 15K Chondrogene 
Microarray Chip (Chondro Chip™) as described herein. Samples are clustered and marked 
as representing patients who have type 2 diabetes or control individuals. The number of 
hybridizations profiles determined for patients with type 2 diabetes or who are controls are 
shown. Various experiments were performed as outlined above, and analyzed using either 

30 the Wilcox Mann Whitney rank sum test, or other statistical tests as described herein, and 

those genes identified with a p value of < 0.05 as between the patients with type 2 diabetes as 
compared with patients without type 2 diabetes are shown in Table IP. 
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Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with type 2 diabetes can be done using the differentially expressed 
genes as shown in Table IP in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
5 Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

RNA expression profllesRNA expression profllesLung Disease 

This example demonstrates the use of the claimed invention to identify 
L0 biomarkers of Lung Disease and use of same. 

As used herein, "lung disease" encompasses any disease that affects the respiratory 
system and includes bronchitis, chronic obstructive lung disease, emphysema, asthma, and 
lung cancer. Patients identified as having lung disease includes patients having one or more 
of the above noted conditions. In each case, the diagnosis of lung disease was corroborated 

15 by a skilled Board certified physician. Figure 14 shows a diagrammatic representation of 
RNA expression profllesRNA expression profiles of Whole blood samples from individuals 
who were identified as having lung disease as described herein as compared with RNA 
expression profllesRNA expression profiles from normal and non lung disease individuals. 
Samples are clustered and marked as representing patients who have lung disease, are normal 

20 or do not have lung disease. The "*" indicates those patients who abnormally clustered 

despite actual presentation. The number of hybridizations profiles determined for either the 
lung disease patients, non-lung disease patients and nonnal individuals are show. Various 
experiments were performed as outlined above, and analyzed using the Wilcox Mann 
Whitney rank sum test, or other statistical analysis as described herein and those genes 

25 identified with a p value of < 0.05 as between the patients with lung disease as compared with 
patients without lung disease are shown in Table 1R. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with lung disease can be done using the differentially expressed 
genes as shown in Table 1R in combination with well known statistical algorithms for class 
30 prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 
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Bladder Cancer 

This example demonstrates the use of the claimed invention to identify 
biomarkers of bladder cancer and use of same. 

5 As used herein, "bladder cancer" includes carcinomas that occur in the transitional 

epithelium lining the urinary tract, starting at the renal pelvis and extending through the 
ureter, the urinary bladder, and the proximal two-thirds of the urethra. As used herein, 
patients diagnosed with bladder cancer include patients diagnosed utilizing any of the 
following methods or a combination thereof: urinary cytologic evaluation, endoscopic 
0 evaluation for the presence of malignant cells, CT (computed tomography), MRI (magnetic 
resonance imaging) for metastasis status. In each case, the diagnosis of bladder cancer was 
corroborated by a skilled Board certified physician. Figure 15 shows a diagrammatic 
representation of RNA expression profilesRNA expression profiles of Whole blood samples 
from individuals who were identified as having bladder cancer as described herein as 

15 compared with RNA expression profilesRNA expression profiles from non bladder cancer 
individuals. Expression profiles were generated using GeneSpring™ software analysis as 
described herein. Each column represents the hybridization pattern resulting from a single 
individual. Non bladder cancer individuals presented without bladder cancer, but may have 
presented with other medical conditions and may be under various treatment regimes. 

20 Hybridizations to create said RNA expression profilesRNA expression profiles were done 
using the Affymetrix U133A chip. A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who have bladder cancer, or do not have 
bladder cancer. The indicates those patients who abnormally clustered as either bladder 
cancer, or non bladder cancer despite actual presentation. The number of hybridizations 

25 profiles determined for patients with bladder cancer and without bladder cancer to create said 
Figure are shown. Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test, or other statistical tests as described 
herein and those genes identified with a p value of < 0.05 as between the patients with 
bladder cancer as compared with patients without bladder cancer are shown in Tables IS. 

30 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with bladder cancer can be done using the differentially expressed 
genes as shown in Table IS in combination with well known statistical algorithms for class 
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prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

5 Coronary Artery Disease 

This example demonstrates the use of the claimed invention to identify 
biomarkers of coronary artery disease and use of same. 

As used herein, "Coronary artery disease" (CAD) is defined as a condition wherein at 
least one coronary artery has >50% luminal diameter stenosis, as diagnosed by coronary 

10 angiography and includes conditions in which there is atheromatous narrowing and 
subsequent occlusion of the vessel. CAD includes those conditions which manifest as 
angina, silent ischaemia, unstable angina, myocardial infarction, arrhythmias, heart failure, 
and sudden death. Patients identified as having CAD includes patients having one or more 
of the above noted conditions. In each case, the diagnosis of Coronary artery disease was 

15 corroborated by a skilled Board certified physician. Figure 17 shows a diagrammatic 

representation of RNA expression profilesRNA expression profiles of Whole blood samples 
from individuals who were identified as having coronary artery disease (CAD) as described 
herein as compared with RNA expression profilesRNA expression profiles from non- 
coronary artery disease individuals. Expression profiles were generated using GeneSpring™ 

20 software analysis as described herein. Each column represents the hybridization pattern 

resulting from a single individual. Non coronary artery disease individuals presented without 
coronary artery disease, but may have presented with other medical conditions and may be 
under various treatment regimes. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the Affymetrix U133A chip. A dendogram analysis is 

25 shown above. Samples are clustered and marked as representing patients who have coronary 
artery disease or do not have coronary artery disease. The indicates those patients who 
abnormally clustered despite actual presentation. The number of hybridizations profiles 
determined for patients with CAD or without CAD are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 

30 test, or other statistical analysis as described herein and those genes identified with a p value 
of < 0.05 as between the patients with coronary artery disease as compared with patients 
without coronary artery disease are shown in Table 1U. 

78 



WO 2004/112589 PCT/US2004/020836 
Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with CAD can be done using the differentially expressed genes as 
shown in Table 1U in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
5 available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



Rheumatoid Arthritis 

This example demonstrates the use of the claimed invention to identify 
10 biomarkers of rheumatoid arthritis and use of same. 

As used herein "Rheumatoid Arthritis" (RA) is defined as a chronic, multisystem 
disease of unknown etiology with the characteristic feature of persistent inflammatory 
synovitis. Said inflammatory synovitis usually involves peripheral joints in a systemic 
distribution. Patients having RA as defined herein were identified as having one or more of 

15 the following; (i)cartilage destruction, (ii)bone erosions and/or (iii) joint deformities. Whole 
blood samples were taken from patients who were diagnosed Rheumatoid arthritis as defined 
herein. In each case, the diagnosis of Rheumatoid arthritis was corroborated by a skilled 
Board certified physician. Figure 18 shows a diagrammatic representation of RNA expression 
profilesRNA expression profiles of Whole blood samples from individuals who were 

20 identified as having rheumatoid arthritis as described herein as compared with RNA 
expression profilesRNA expression profiles from non-rheumatoid arthritis individuals. 
Expression profiles were generated using GeneSpring™ software analysis as described 
herein. Each column represents the hybridization pattern resulting from a single individual. 
Normal individuals have no known medical conditions and were not taking any known 

25 medication. Non rheumatoid arthritis individuals presented without rheumatoid arthritis, but 
may have presented with other medical conditions and may be under various treatment 
regimes. Hybridizations to create said RNA expression profilesRNA expression profiles 
were done using ChondroChip™ and Affymetrix U133A Chip. A dendogram analysis using 
the ChondroChip is shown above. Samples are clustered and marked as representing patients 

30 who have rheumatoid arthritis or do not have rheumatoid arthritis. The indicates those 
patients who abnormally clustered despite actual presentation. The number of hybridizations 
profiles determined for patients with rheumatoid arthritis and without rheumatoid arthritis are 
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shown. Various experiments were performed as outlined above and analyzed using either the 
Wilcox Mann Whitney rank sum test, or other statistical tests as described herein and those 
genes identified with a p value of < 0.05 as between the patients with rheumatoid arthritis as 
compared with patients without rheumatoid arthritis are shown. Data generated using the 
5 ChondroChip™ array is shown in Table IV whereas data generated using the Affymetrix 
U133A Chip is shown in Table 1 W. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with rheumatoide arthritis can be done using, the differentially 
expressed genes as shown in Table IV and 1 W in combination with well known statistical 
10 algorithms for class prediction as would be understood by a person skilled in the art and is 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 



15 Depression 

This example demonstrates the use of the claimed invention to identify biomarkers of 
depression and use of same. 

As used herein "depression" includes depressive disorders or depression in 
association with medical illness or substance abuse in addition to depression as a result of 

20 sociological situations. Patients defined as having depression were diagnosed mainly on the 
basis of clinical symptoms including a depressed mood episode wherein a person displays a 
depressed mood on a daily basis for a period of greater than 2 weeks. A depressed mood 
episode may be characterized by sadness, indifference, apathy, or irritability and is usually 
associated with changes in a number of neurovegetative functions, including sleep patterns, 

25 appetite and weight, fatigue, impairment in concentration and decision making. Whole blood 
samples were taken from patients who were diagnosed with depression as defined herein. In 
each case, the diagnosis of depression was corroborated by a skilled Board certified 
physician. Figure 19 shows a diagrammatic representation of RNA expression profilesRNA 
expression profiles of Whole blood samples from individuals who were identified as having 

30 depression as described herein as compared with RNA expression profilesRNA expression 
profiles from non-depression individuals. Expression profiles were generated using 
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GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattern resulting from a single individual. Normal individuals have no known 
medical conditions and were not taking any known medication. Non depression individuals 
presented without depression, but may have presented with other medical conditions and may 
be under various treatment regimes. Hybridizations to create said RNA expression 
profilesRNA expression profiles were done using ChondroChip™. A dendogram analysis is 
shown above. Samples are clustered and marked as representing patients who have 
depression, having non-depression or normal. The indicates those patients who 
abnormally clustered despite actual presentation. The number of hybridizations profiles 
determined for patients with depression, non-depression and normal are shown. Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test, or other statistical tests as described herein and those genes identified 
with a p value of < 0.05 as between the patients with depression as compared with patients 
without depression are shown in Table IX. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with depression can be done using the differentially expressed genes 
as shown in Table IX in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 



Osteoarthritis 

This example demonstrates the use of the claimed invention to identify 
biomarkers which differentiate various stages of Osteoarthritis and use of same. 

"Osteoarthritis" (OA), as used herein also known as "degenerative joint disease", represents 
failure of a diarthrodial (movable, syno vial-lined) joint. It is a condition, which affects joint 
cartilage, and or subsequently underlying bone and supporting tissues leading to pain, 
stiffness, movement problems and activity limitations. It most often affects the hip, knee, 
foot, and hand, but can affect other joints as well. OA severity can be graded according to the 
system described by Marshall (Marshall KW. J Rheumatol, 1996:23(4) 582-85). Briefly, 
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each of the six knee articular surfaces was assigned a cartilage grade with points based on the 
worst lesion seen on each particular surface. Grade 0 is normal (0 points), Grade I cartilage 
is soft or swollen but the articular surface is intact (1 point). In Grade II lesions, the cartilage 
surface is not intact but the lesion does not extend down to subchondral bone (2 points). 
Grade III damage extends to subchondral bone but the bone is neither eroded nor eburnated 
(3 points). In Grade IV lesions, there is eburnation of or erosion into bone (4 points). A 
global OA score is calculated by summing the points from all six cartilage surfaces. If there 
is any associated pathology, such as meniscus tear, an extra point will be added to the global 
score. Based on the total score, each patient is then categorized into one of four OA groups: 

3 mild (1-6), moderate (7-12), marked (13-18), and severe (>1 8). As used herein, patients 
identified with OA may be categorized in any of the four OA groupings as described above. 
Whole blood samples were taken from patients who were diagnosed with osteoarthritis and a 
specific stage of osteoarthritis as defined herein. In each case, the diagnosis of 
osteoarthritis and the stage of osteoarthritis was corroborated by a skilled Board certified 

5 physician . Figure 20 shows a diagrammatic representation of RNA expression profilesRNA 
expression profiles of Whole blood samples from individuals having various stages of 
osteoarthritis as compared with RNA expression profilesRNA expression profiles from 
normal individuals. Expression profiles were generated using GeneSpring™ software 
analysis as described herein. Each column represents the hybridization pattern resulting from 
10 a single individual. Normal individuals have no known medical conditions and were not 
taking any known medication. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the ChondroChip™. A dendogram analysis is shown 
above. Samples are clustered and marked as representing patients who presented with 
different stages of osteoarthritis or normal. The indicates those patients who abnormally 
25 clustered despite actual presentation. The number of hybridizations profiles determined for 
either osteoarthritis patients or normal individuals are shown in Figure 20. Statistical analysis 
was done using an ANOVA test and those genes identified with a p value of < 0.05 in 
pairwise comparisons between patients with mild, moderate, marked, severe or no 
osteoarthritis as shown in Table 1 Y. 

30 

Liver Cancer 



82 



WO 2004/112589 PCT/US2004/020836 
This example demonstrates the use of the claimed invention to identify biomarkers of 
liver cancer and use of same. 

As used herein, "liver cancer" means primary liver cancer wherein the cancer initiates 
in the liver. Primary liver cancer includes both hepatomas or hepatocellular carcinomas 
5 (HCC) which start in the liver and chonalgiomas where cancers develop in the bile ducts of 
the liver. Whole blood samples were taken from patients who were diagnosed with liver 
cancer as defined herein. In each case, the diagnosis of liver cancer was corroborated by a 
skilled Board certified physician. Figure 21 shows a diagrammatic representation of RNA 
expression profilesRNA expression profiles of Whole blood samples from individuals who 

10 were identified as having liver cancer as described herein as compared with RNA expression 
profilesRNA expression profiles from non-liver cancer disease individuals. Expression 
profiles were generated using GeneSpring™ software analysis as described herein. Each 
column represents the hybridization pattern resulting from a single individual. Control 
samples presented without liver cancer but may have presented with other medical conditions 

15 and may be under various treatment regimes. Hybridizations to create said RNA expression 
profilesRNA expression profiles were done using the Affymetrix® U133A chip. A 
dendogram analysis is shown above. Samples are clustered and marked as representing 
patients who have liver cancer or control. The number of hybridizations profiles determined 
for patients with liver cancer or who are controls are shown. Various experiments were 

20 performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein, and those genes identified with a p value of 
< 0.05 as between the patients with liver cancer as compared with patients without liver 
cancer are shown in Table 1Z. 

Classification or class prediction of a test sample from an unknown patient in order to 
25 diagnose said individual with liver cancer can be done using the differentially expressed 
genes as shown in Table 1Z in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

30 

Schizophrenia 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
diabetes and use of same. 

As used herein, "schizophrenia" is defined as a psychotic disorders characterized by 
distortions of reality and disturbances of thought and language and withdrawal from social 

5 contact. Patients diagnosed with "schizophrenia" can include patients having any of the 
following diagnosis: an acute schizophrenic episode, borderline schizophrenia, catatonia, 
catatonic schizophrenia, catatonic type schizophrenia, disorganized schizophrenia, 
disorganized type schizophrenia, hebephrenia, hebephrenic schizophrenia, latent 
schizophrenia, paranoic type schizophrenia, paranoid schizophrenia, paraphrenia, paraphrenic 

10 schizophrenia, psychosis, reactive schizophrenia or the like. Whole blood samples were taken 
from patients who were diagnosed with schizophrenia as defined herein. In each case, the 
diagnosis of schizophrenia was corroborated by a skilled Board certified physician. Figure 
22 shows a diagrammatic representation of RNA expression profilesRNA expression profiles 
of Whole blood samples from individuals who were identified as having schizophrenia as 

15 described herein as compared with RNA expression profilesRNA expression profiles from 
non schizophrenic individuals. Expression profiles were generated using GeneSpring™ 
software analysis as described herein. Each column represents the hybridization pattern 
resulting from a single individual. Control samples presented without schizophrenia but may 
have presented with other medical conditions and may be under various treatment regimes. 

20 Hybridizations to create said RNA expression profilesRNA expression profiles were done 
using the Affymetrix® U133A chip. A dendogram analysis is shown above. Samples are 
clustered and marked as representing patients who have schizophrenia or control individuals. 
The number of hybridizations profiles determined for patients with schizophrenia or who are 
controls are shown. Various experiments were performed as outlined above, and analyzed 

15 using either the Wilcox Mann Whitney rank sum test, or other statistical tests as described 
herein, and those genes identified with a p value of < 0.05 as between the patients with 
schizophrenia as compared with patients without schizophrenia are shown in Table 1 AA. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with schizophrenia can be done using the differentially expressed 
10 genes as shown in Table 1 AA in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Chagas disease 

This example demonstrates the use of the claimed invention to identify 
biomarkers of Chagas' disease and use of same. 

5 As used herein, "Chagas' disease" is defined as a condition wherein an individual is 

infected with the protozoan parasite Trypanosoma cruzi and includes both acute and chronic 
infection. Acute infection with T. cruzi can be diagnosed by detection of parasites by either 
microscopic examination of fresh anticoagulated blood or the buffy coat, giemsa-stained thin 
and thick blood smears and/or mouse inoculation and culturing of the blood of a potentially 

10 infected individual. Even in the absence of a positive result from the above, an accurate 

determination of infection can be made by xenodiagnosis wherein reduviid bugs are allowed 
to feed on the patient's blood and subsequently the bugs are examined for infection. Chronic 
infection can be determined by detection of antibodies specific to the T. cruzi antigens and/or 
immunoprecipitation and electrophoresis of the T. cruzi antigens. 

L5 As used herein "Symptomatic Chagas disease" includes symptomatic acute chagas 

and symptomatic chronic chagas disease. Acute symptomatic chagas disease can be 
characterized by one or more of the following: area of erythema and swelling (a chagoma); 
local lymphadenopathy; generalized lymphadenopathy; mild hepatosplenomegaly; unilateral 
painless edema of the palpebrae and periocular tissues; malaise; fever; anorexia and/or edema 

to of the face and lower extremities. Symptomatic chronic Chagas' disease include one or more 
of the following symptoms: heart rhythm disturbances, cardiomyopathy, thromboembolism, 
electrocardiographic abnormalities including right bundle-branch blockage; atrioventricular 
block; premature ventricular contractions and tachy- and bradyarrhythmias; dysphagia; 
odynophagia, chest pain; regurgitation; weight loss, cachexia and pulmonary infections. 

5 As used herein "Asymptomatic Chagas disease" is meant to refer to individuals who 

are infected with T. cruzi but who do not show either acute or chronic symptoms of the 
disease. 

Whole blood samples were taken from patients who were diagnosed symptomatic or 
asymptomatic Chagas disease as defined herein. In each case, the diagnosis of Chagas 
D disease was corroborated by a qualified physician. Figure 23 shows a diagrammatic 

representation of RNA expression profilesRNA expression profiles of Whole blood samples 
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from individuals who were identified as having symptomatic Chagas disease; asymptomatic 
Chagas disease or who were control individuals as described herein as compared with RNA 
expression profilesRNA expression profiles from individuals not having Chagas Disease. 
Expression profiles were generated using GeneSpring™ software analysis as described 
5 herein. Each column represents the hybridization pattern resulting from a single individual. 
Control samples presented without Chagas disease but may have presented with other 
medical conditions and may be under various treatment regimes. Hybridizations to create 
said RNA expression profilesRNA expression profiles were done using the Affymetrix® 
U133A chip. A dendogram analysis is shown above. Samples are clustered and marked as 

10 representing patients who have symptomatic chagas disease; asymptomatic chagas disease or 
control. The number of hybridizations profiles determined for patients with chagas disease; 
asymptomatic chagas disease or who are controls are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein. Those genes identified with a p value of < 

15 0.05 as between the patients with Chagas disease as compared with patients without Chagas 
disease are shown in Table 1 AB. Those genes identified with a p value of < 0.05 as between 
the patients with Asymptomatic Chagas disease as compared with patients without Chagas 
disease are shown in Table 5T. Those genes identified with a p value of < 0.05 as between 
the patients with Symptomatic Chagas disease as compared with patients without Chagas 

20 disease are shown in Table 5U. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with symptomatic chagas disease can be done using the 
differentially expressed genes as shown in Table 5U in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
25 and is described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. Classification or class 
prediction of a test sample from an unknown patient in order to diagnose said individual with 
asymptomatic chagas disease can be done using the differentially expressed genes as shown 
in Table 5T. 

30 Asthma 

This example demonstrates the use of the claimed invention to identify biomarkers of 
asthma disease and use of same. 
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As used herein, "asthma" indicates a chronic disease of the airways in the lungs 
characterized by constriction (the tightening of the muscles surrounding the airways) and 
inflammation (the swelling and irritation of the airways). Together constriction and 
inflammation cause narrowing of the airways, which results in symptoms such as wheezing, 
coughing, chest tightness, and shortness of breath. Whole blood samples were taken from 
patients who were diagnosed with asthma as defined herein. In each case, the diagnosis of 
asthma was corroborated by a skilled Board certified physician. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. Hybridizations to 
create said RNA expression profilesRNA expression profiles were done using the 
ChondroChip™ and the Affymetrix Chip. Samples are clustered and marked as representing 
patients who have asthma or control individuals. The number of hybridizations profiles 
determined for patients with asthma and controls are shown. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test, or other statistical tests as described herein. Those genes identified with a p value of < 
0.05 as between the patients with asthma as compared with patients without asthma using the 
ChondroChip™ are shown in Table IAD. Those genes identified with a p value of < 0.05 as 
between the patients with asthma as compared with patients without asthma using the 
Affymetrix® platform are shown in Table 1 AE. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with asthma can be done using the differentially expressed genes as 
shown in Table IAD and Table 1AE in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

Hypertension 

This example demonstrates the use of the claimed invention to identify biomarkers of 
hypertension and use of same. 

As used herein, "hypertension" is defined as high blood pressure or elevated arterial pressure. 
Patients identified with hypertension herein include persons who have an increased risk of 
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developing a morbid cardiovascular event and/or persons who benefit from medical therapy 
designed to treat hypertension. Patients identified with hypertension also can include persons 
having systolic blood pressure of >130 mm Hg or a diastolic blood pressure of >90 mm Hg or 
a person takes antihypertensive medication. Whole blood samples were taken from patients 
5 who were diagnosed with hypertension as defined herein. In each case, the diagnosis of 
hypertension was corroborated by a skilled Board certified physician. Figure 5 shows a 
diagrammatic representation of RNA expression profilesRNA expression profiles of Whole 
blood samples from individuals who were identified as having hypertension as described 
herein as compared with RNA expression profilesRNA expression profiles from non 

10 hypertensive individuals and normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattern resulting from a single individual. Non hypertensive individuals 
presented without hypertension but may have presented with other medical conditions and 
may be under various treatment regimes. Normal individuals presented without any known 

15 conditions. Hybridizations to create said RNA expression profilesRNA expression profiles 
were done using a 1 5K Chondrogene Microarray Chip (ChondroChip™) as described herein. 
Samples are clustered and marked as representing patients who have hypertension or control 
individuals. The number of hybridizations profiles determined for patients with 
hypertension, without hypertension or who are controls are shown in Figure 5. Various 

20 experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with hypertension as compared with patients 
without hypertension are shown in Table IE. Table 1 AG shows those genes identified with a 
p value of < 0.05 as between the patients with hypertension as compared with patients 

25 without hypertension from gene expressions profiles generated by analogous experiments 
using the Affynetrix® GeneChip®. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with hypertension can be done using the differentially expressed 
genes as shown in Table IE and 1 AG in combination with well known statistical algorithms 
30 for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Obesity 

This example demonstrates the use of the claimed invention to identify biomarkers of 
obesity and use of same. 

As used herein, "obesity" is defined as an excess of adipose tissue that imparts a 
health risk. Obesity is assessed in terms of height and weight in the relevance of age. 
Patients who are considered obese include, but are not limited to, patients having a body mass 
index or BMI ((defined as body weight in kg divided by (height in meters) 2 ) greater than or 
equal to 30.0. Patients having obesity as defined herein are those with a BMI of greater than 
or equal to 30.0. Whole blood samples were taken from patients who were diagnosed with 
obesity as defined herein. In each case, the diagnosis of obesity was corroborated by a 
skilled Board certified physician. Figure 6 shows a diagrammatic representation of RNA 
expression profilesRNA expression profiles of Whole blood samples from individuals who 
were identified as having obesity as described herein as compared with RNA expression 
profilesRNA expression profiles from non obese individuals. RNA expression profilesRNA 
expression profiles were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profilesRNA expression profiles 
were done using the 15K Chondrogene Microarray Chip (ChondroChip™) as described 
herein. Samples are clustered and marked as representing patients who have obesity, those 
who are not obese, and normal individuals. The number of hybridizations profiles 
determined for patients with obesity, were not obese, and normal individuals are shown. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test, or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with obesity as compared with 
patients without obesity are shown in Table IF. Table 1 AH shows those genes identified 
with a p value of < 0.05 as between the patients with obesity as compared with patients 
without obesity from gene expressions profiles generated by analogous experiments using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) . 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with obesity can be done using the differentially expressed genes as 
shown in Table IF in combination with well known statistical algorithms for class prediction 
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as would be understood by a person skilled in the art and is described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 



Psoriasis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
psoriasis and use of same. 

As used herein, "psoriasis" is defined as a common multifactorial inherited condition 
characterized by the eruption of circumscribed, discrete and confluent, reddish, silvery-scaled 
maculopapules; the lesions occur predominantly on the elbows, knees, scalp, and trunk, and 
microscopically show characteristic parakeratosis and elongation of rete ridges with 
shortening of epidermal keratinocyte transit time due to decreased cyclic guanosine 
monophosphate, according to Stedman's Online Medical Dictionary, 27th Edition. Whole 
blood samples were taken from patients who were diagnosed with psoriasis as defined herein. 
In each case, the diagnosis of psoriasis was corroborated by a skilled Board certified 
physician. RNA expression profilesRNA expression profiles of Whole blood samples from 
individuals who were identified as having psoriasis as opposed to not having psoriasis as 
described herein were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profilesRNA expression profiles were done 
using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) as described 
herein (data not shown). . Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test, or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with psoriasis as compared with patients without psoriasis are shown in Table 5A. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with psoriasis can be done using the differentially expressed genes 
as shown in Table 5 A in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Thyroid Disorder 

This example demonstrates the use of the claimed invention to identify biomarkers of 
5 thyroid disorder and use of same. 

As used herein, "thyroid disorder" is defined as an overproduction of thyroid hormone 
(hyperthyroidism) , underproduction of thyroid hormone (hypothyroidism), benign 
(noncancerous) thyroid disease, and thyroid cancer. Thyroid disorders include Anaplastic 
carcinoma of the thyroid, Chronis thyroiditis (Hashimoto's disease), colloid nodular goiter, 

10 hyperthyroidism, hyperpituitarism, hypothyridism-primary, hypothyridism-secondary, 

medullary thyroid carcinoma, painless (silent) thyroiditis, papillary carcinoa of the thyroid, 
subacute thyroiditis, thyroid cancer and congenital goiter, according to MEDLINE plus 
Illustrated Medical Encyclopedia . Whole blood samples were taken from patients who were 
diagnosed with a thyroid disorder as defined herein. In each case, the diagnosis of a thyroid 

15 disorder was corroborated by a skilled Board certified physician. RNA expression 
profilesRNA expression profiles of Whole blood samples from individuals who were 
identified as having a thyroid disorder as described herein were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profilesRNA expression profiles were done using the Affymetrix® GeneChip® 

20 platforms (Ul 33 A and Ul 33 Plus 2.0) as described herein (data not shown). Various 

experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with a thyroid disorder as compared with 
patients without a thyroid disorder are shown in Table 5B. 

25 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with thyroid disorder can be done using the differentially expressed 
genes as shown in Table 5B in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

30 GeneSpring™) for Class Prediction are also available. 
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Irritable Bowel Syndrome 

This example demonstrates the use of the claimed invention to identify biomarkers of 
irritable bowel syndrome and use of same. 

As used herein, "irritable bowel syndrome" is defined as a common gastrointestinal disorder 

5 involving an abnormal condition of gut contractions (motility) characterized by abdominal 
pain, bloating, mucous in stools, and irregular bowel habits with alternating diarrhea and 
constipation, symptoms that tend to be chronic and to wax and wane over the years, 
according to MedicineNet, Inc., an online, healthcare media publishing company. Whole 
blood samples were taken from patients who were diagnosed with irratble bowel syndrome as 

10 defined herein. In each case, the diagnosis of irratble bowel syndrome was corroborated by 
a skilled Board certified physician. RNA expression profilesRNA expression profiles of 
Whole blood samples from individuals who were generated using GeneSpring™ software 
analysis as described herein. Hybridizations to create said RNA expression profilesRNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 

15 U133 Plus 2.0) as described herein (data not shown).. Samples are clustered and marked as 
representing patients who have irratble bowel syndrome or control individuals. Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with irritable bowel syndrome as compared 

20 with patients without irratble bowel syndrome are shown in Table 5C. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with irritable bowel syndrome can be done using the differentially 
expressed genes as shown in Table 5C in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
25 herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

Osteoporosis 

30 This example demonstrates the use of the claimed invention to identify biomarkers of 

osteoporosis and use of same. 
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As used herein, "osteoporosis" is defined as a reduction in the quantity of bone or atrophy of 
skeletal tissue; an age-related disorder characterized by decreased bone mass and increased 
susceptibility to fractures, according to Stedman's Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken from patients who were diagnosed with osteoporosis 
5 syndrome as defined herein. In each case, the diagnosis of osteoporosis was corroborated by 
a skilled Board certified physician. RNA expression profilesRNA expression profiles of 
Whole blood samples from individuals who were identified as having osteoporosis as 
described herein as compared with RNA expression profilesRNA expression profiles from 
individuals not having osteoporosis, were generated using GeneSpring™ software analysis 

10 as described herein. Hybridizations to create said RNA expression profilesRNA expression 
profiles were done using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 
2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have osteoporosis or control individuals. Various experiments 
were performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank 

15 sum test or other statistical tests as described herein, and those genes identified with a p value 
of < 0.05 as between the patients with osteoporosis as compared with patients without 
osteoporosis are shown in Table 5D, 

Migraine Headaches 

20 This example demonstrates the use of the claimed invention to identify biomarkers of 

migraine headaches and use of same. 

As used herein, "Migraine Headaches" is defined as a symptom complex occurring 
periodically and characterized by pain in the head (usually unilateral), vertigo, nausea and 
vomiting, photophobia, and scintillating appearances of light. Classified as classic migraine, 

25 common migraine, cluster headache, hemiplegic migraine, ophthalmoplegic migraine, and 
ophthalmic migraine, according to Stedman's Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken from patients who were diagnosed with migraine headaches 
as defined herein. In each case, the diagnosis of migraine headaches was corroborated by a 
skilled Board certified physician. RNA expression profiles of Whole blood samples from 

30 individuals who were identified as having migraine headaches as described herein as 

compared with RNA expression profiles from individuals not having migraine headaches, 
were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
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(U133A and U133 Plus 2.0) as described herein (data not shown). Samples are clustered and 
marked as representing patients who have migraine headaches or control individuals. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
5 identified with a p value of < 0.05 as between the patients with migraine headaches as 
compared with patients without migraine headaches are shown in Table 5E. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with migraine headaches can be done using the differentially 
expressed genes as shown in Table 5E in combination with well known statistical algorithms 
10 for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

15 Eczema 

This example demonstrates the use of the claimed invention to identify biomarkers of 
eczema and use of same. 

As used herein, "Eczema" is defined as inflammatory conditions of the skin, particularly with 
vesiculation in the acute stage, typically erythematous, edematous, papular, and crusting; 

20 followed often by lichenification and scaling and occasionally by duskiness of the erythema 
and, infrequently, hyperpigmentation; often accompanied by sensations of itching and 
burning; the vesicles form by intraepidermal spongiosis; often hereditary and associated with 
allergic rhinitis and asthma, according to Stedman's Online Medical Dictionary, 27th Edition. 
Whole blood samples were taken from patients who were diagnosed with eczema as defined 

25 herein. In each case, the diagnosis of eczema headaches was corroborated by a skilled 
Board certified physician. RNA expression profiles of Whole blood samples from 
individuals who were identified as having eczema as described herein as compared with 
RNA expression profiles from individuals not having eczema, were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 

30 expression profiles were done using Affymetrix® GeneChip® platforms (U133A and U133 
Plus 2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have eczema or control individuals. Various experiments were 
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performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with eczema as compared with patients without eczema are 
shown in Table 5F. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with eczema can be done using the differentially expressed genes as 
shown in Table 5F in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 



Manic Depression Syndrome 

This example demonstrates the use of the claimed invention to identify biomarkers of 
manic depression syndrome and use of same. 

As used herein, "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. Whole blood samples were taken from 
patients who were diagnosed with manic depression as defined herein. In each case, the 
diagnosis of manic depression was corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples from individuals who were identified as having 
manic depression as described herein as compared with RNA expression profiles from 
individuals not having manic depression, were generated using GeneSpring™ software 
analysis as described herein. Hybridizations to create said RNA expression profiles were 
done using Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms 
(U133A and U133 Plus 2.0) as described herein (data not shown). Samples are clustered 
and marked as representing patients who have manic depression or control individuals. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Maim Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with manic depression syndrome 
as compared with patients without manic depression syndrome are shown in Table 51. 
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Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with manic depression syndrome can be done using the 
differentially expressed genes as shown in Table 51 in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
5 and is described herein. Commercially available programs such as thoae provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Crohn's Colitis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
10 Crohn's Colitis and use of same. 

As used herein, "Crohn's Colitis" is defined as a chronic granulomatous inflammatory 
disease of unknown etiology, involving any part of the gastrointestinal tract from mouth to 
anus, but commonly involving the terminal ileum with scarring and thickening of the bowel 
wall; it frequently leads to intestinal obstruction and fistula and abscess formation and has a 

15 high rate of recurrence after treatment, according to Borland's Illustrated Medical Dictionary. 
In each case, the diagnosis of Crohn's colitis was corroborated by a skilled Board certified 
physician. RNA expression profiles of Whole blood samples from individuals who were 
identified as having Crohn's colitis as described herein as compared with RNA expression 
profiles from individuals not having Crohn's colitis, were generated using GeneSpring™ 

20 software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) 
platforms (U133A and U133 Plus 2.0) as described herein (data not shown).. Samples are 
clustered and marked as representing patients who have Crohn's colitis or control individuals. 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 

25 Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with Crohn's colitis as compared 
with patients without Crohn's colitis are shown in Table 5 J. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with Crohn's colitis can be done using the differentially expressed 
30 genes as shown in Table 5 J in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
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Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



5 Chronic cholecystitis 

This example demonstrates the use of the claimed invention to identify biomarkers of 
Chronic Cholecystitis and use of same. 

As used herein, "Chronic cholecystitis" is defined as chronic inflammation of the 
gallbladder, usually secondary to lithiasis, with lymphocytic infiltration and fibrosis that may 

L0 produce marked thickening of the wall, according to Stedman's Online Medical Dictionary, 
27th Edition. In each case, the diagnosis of chronic cholecystitis was corroborated by a 
skilled Board certified physician. RNA expression profiles of Whole blood samples from 
individuals who were identified as having chronic cholecystitis as described herein as 
compared with RNA expression profiles from individuals not having chronic cholecystitis, 

15 were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
not shown). Samples are clustered and marked as representing patients who have chronic 
cholecystitis or control individuals. Various experiments were performed as outlined above, 

20 and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with chronic cholecystitis as compared with patients without chronic cholecystitis are shown 
in Table 5K. 

Classification or class prediction of a test sample from an unknown patient in order to 
25 diagnose said individual with chronic cholecystitis can be done using the differentially 

expressed genes as shown in Table 5K in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Cervical Cancer 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
cervical cancer and use of same. 



As used herein, "Cervical Cancer" is defined as cancer of the uterine cervix, the 
portion of the uterus attached to the top of the vagina. Ninety percent of cervical cancers arise 
5 from the flattened or "squamous" cells covering the cervix. Most of the remaining 10% arise 
from the glandular, mucus-secreting cells of the cervical canal leading into the uterus. . In 
each case, the diagnosis of cervical cancer was corroborated by a skilled Board certified 
physician. RNA expression profiles of Whole blood samples from individuals who were 
identified as having cervical cancer as described herein as compared with RNA expression 

10 profiles from individuals not having cervical cancer, were generated using GeneSpring™ 

software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms 
(U133A and U133 Plus 2.0) as described herein (data not shown). Samples are clustered 
and marked as representing patients who have cervical cancer or control individuals. Various 

15 experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with cervical cancer as compared with 
patients without cervical cancer are shown in Table 5M. 

Classification or class prediction of a test sample from an unknown patient in order to 
20 diagnose said individual with cervical cancer can be done using the differentially expressed 
genes as shown in Table 5M in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Stomach Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
stomach cancer and use of same. 

30 As used herein, "Stomach Cancer" is defined as are malignacies of the stomach, the 

most common type being adenocarcinoma. Stomach is divided into. Cancer can develop in 
any of five different layers of the stomach. . In each case, the diagnosis of stomach cancer 
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was corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having stomach cancer as described 
herein as compared with RNA expression profiles from individuals not having stomach 
cancer, were generated using GeneSpring™ software analysis as described herein. 

5 Hybridizations to create said RNA expression profiles were done using the Affymetrix® 

GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as 
described herein (data not shown). Samples are clustered and marked as representing 
patients who have stomach cancer or control individuals. Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 

10 test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with stomach cancer as compared with patients without stomach 
cancer are shown in Table 5N. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with stomach cancer can be done using the differentially expressed 
15 genes as shown in Table 5N in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

20 

Kidney Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
kidney cancer and use of same. 

As used herein, "Kidney Cancer" is defined as are malignacies of the kidney, the most 
25 common type being renal cell carcinomaln each case, the diagnosis of kidney cancer was 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having kidney cancer as described herein as 
compared with RNA expression profiles from individuals not having kidney cancer, were 
generated using GeneSpring™ software analysis as described herein. Hybridizations to 
30 create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
not shown).. Samples are clustered and marked as representing patients who have stomach 
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cancer or control individuals. Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with kidney cancer as compared with patients without kidney cancer are shown in Table 50. 

5 Classification or class prediction of a test sample from an unknown patient in order to 

diagnose said individual with kidney cancer can be done using the differentially expressed 
genes as shown in Table SO in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
10 GeneSpring™) for Class Prediction are also available. 



Testicular Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
15 testicular cancer and use of same. 

As used herein, "Testicular Cancer" is defined as an abnormal, rapid, and invasive 
growth of cancerous (malignant) cells in the testicles. In each case, the diagnosis of testicular 
cancer was corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples from individuals who were identified as having testicular cancer as 

20 described herein were compared with RNA expression profiles from individuals not having 
testicular cancer, using GeneSpring™ software analysis as described herein. Hybridizations 
to create said RNA expression profiles were done using the Affymetrix® GeneChip® 
platforms (U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described 
herein (data not shown). Various experiments were performed as outlined above, and 

25 analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with testicular cancer as compared with patients without testicular cancer are shown in 
Table 5P 

Classification or class prediction of a test sample from an unknown patient in order to 
30 diagnose said individual with testicular cancer can be done using the differentially expressed 
genes as shown in Table 5P in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 



100 



WO 2004/112589 PCT/US2004/020836 

Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



5 Colon Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
colon cancer and use of same. 

As used herein, "Colon Cancer" is defined as cancer of the colon and includes 
carcinoma, which arises from the lining of the large intestine, and lymphoma, melanoma, 

10 carcinoid tumors, and sarcomas. In each case, the diagnosis of colon cancer was corroborated 
by a skilled Board certified physician. RNA expression profiles of Whole blood samples 
from individuals who were identified as having colon cancer as described herein as compared 
with RNA expression profiles from individuals not having colon cancer, were generated 
using GeneSpring™ software analysis as described herein. Hybridizations to create said 

15 RNA expression profiles were done using Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with colon cancer as compared 

20 with patients without colon cancer are shown in Table 5Q. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with colon cancer can be done using the differentially expressed 
genes as shown in Table 5Q in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
25 Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

Hepatitis B 

30 This example demonstrates the use of the claimed invention to identify biomarkers of 

hepatitis B and use of same. 
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As used herein, " Hepatitis B" is a serious disease caused by hepatitis B virus (HBV) that 
attacks human liver. The virus can cause lifelong infection, cirrhosis (scarring) of the liver, 
liver cancer, liver failure, and death. HBV is transmitted horizontally by blood and blood 
products and sexual transmission. It is also transmitted vertically from mother to infant in the 
perinatal period. . In each case, the diagnosis of hepatitis B was corroborated by a skilled 
Board certified physician. RNA expression profiles of Whole blood samples from 
individuals who were identified as having hepatitis as described herein as compared with 
RNA expression profiles from individuals not having hepatitis, were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Samples are clustered and marked as representing patients who have hepatitis or control 
individuals. Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients with hepatitisas 
compared with patients without hepatitis are shown in Table 5R. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with hepatitis B can be done using the differentially expressed 
genes as shown in Table 5R in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Pancreatic Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
pancreatic cancer and use of same. 

As used herein, "Pancreatic Cancer" is defined as cancer of the colon and includes 
carcinoma, which arises from the lining of the large intestine, and lymphoma, melanoma, 
carcinoid tumors, and sarcomas. In each case, the diagnosis of pancreatic cancer was 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having pancreatic cancer as described herein 
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as compared with RNA expression profiles from individuals not having pancreatic cancer, 
were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133AandU133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
5 not shown). Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients with pancreatic 
cancer as compared with patients without pancreatic cancer are shown in Table 5S. 

Classification or class prediction of a test sample from an unknown patient in order to 
10 diagnose said individual with pancreatic cancer can be done using the differentially expressed 
genes as shown in Table 5S in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

15 

Nonalcoholic Steatohepatitis (NASH) 

This example demonstrates the use of the claimed invention to identify biomarkers of 
nonalcoholic steatohepatitis and use of same. 

As used herein, "nonalcoholic steatohepatitis", (NASH) is defined as an inflammatory 
20 disease of the liver associated with the accumulation of fat in the liver. NASH is of uncertain 
pathogenesis and histologically resembling alcoholic hepatitis, but occurring in nonalcoholic 
patients, most often obese women with non-insulin-dependent diabetes mellitus; clinically it 
is generally asymptomatic or mild, but fibrosis or cirrhosis may result. The diagnosis is 
confirmed by a liver biopsy. In each case, the diagnosis of nonalcoholic steatohepatitis was 
25 corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having nonalcoholic steatohepatitis as 
described herein as compared with RNA expression profiles from individuals not having 
nonalcoholic steatohepatitis, were generated using GeneSpring™ software analysis as 
described herein. Hybridizations to create said RNA expression profiles were done using 
10 Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and 
U133 Plus 2.0) as described herein (data not shown). Various experiments were 
performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum 
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test or other statistical tests as described herein, and those genes identified with a p value of < 
0.05 as between the patients with nonalcoholic steatohepatitis as compared with patients 
without nonalcoholic steatohepatitis are shown in Table 5G. 

Classification or class prediction of a test sample from an unknown patient in order to 
5 diagnose said individual with NASH can be done using the differentially expressed genes as 
shown in Table 5G in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 



Alzheimer's Disease 

As used herein, "alzheimer's disease" refers to a degenerative disease of the central 
nervous system characterized especially by premature senile mental deterioration. In each 

15 case, the diagnosis of alzheimer's disease was corroborated by a skilled Board certified 
physician. RNA expression profiles of Whole blood samples from individuals who were 
identified as having Alzheimer's Diseaseas described herein as compared with RNA 
expression profiles from individuals not having alzheimer's disease, were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 

20 expression profiles were done using the Affymetrixd> GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data not shown). 
Samples are clustered and marked as representing patients who have alzheimer's disease or 
control individuals. Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 

25 herein, and those genes identified with a p value of < 0.05 as between the patients with 
alzheimer's disease as compared with patients without alzheimer's disease are shown in 
Table 5H. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with alzheimer's disease can be done using the differentially 
30 expressed genes as shown in Table 5H in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Heart Failure 

This example demonstrates the use of the claimed invention to identify biomarkers of 
heart failure and use of same. 

As used herein, "heart failure" is defined as an inadequacy of the heart so that as a 
pump it fails to maintain the circulation of blood, with the result that congestion and edema 
develop in the tissues; Heart failure is synonymous with congestive heart failure, myocardial 
insufficiency, cardiac insufficiency, cardiac failure, and includes right ventricular failure, 
forward heart failure, backward heart failure and left ventricular failure. Resulting clinical 
syndromes include shortness of breath or nonpitting edema, enlarged tender liver, engorged 
neck veins, and pulmonary rales in various combinations. In each case, the diagnosis of heart 
failure was corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples from individuals who were identified as having heart failure described 
herein as compared with RNA expression profiles from individuals not having hearat failure, 
were generated using GeneSpring™ software analysis as described herein. Hybridizations to 
create said RNA expression profiles were done using Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as described herein (data 
not shown). Samples are clustered and marked as representing patients who have heart 
failure or control individuals. Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with heart failure as compared with patients without heart failure are shown in Table 5L. 

Classification or class, prediction of a test sample from an unknown patient in order to 
diagnose said individual with heart failure can be done using the differentially expressed 
genes as shown in Table 5L in combination with well known statistical algorithms for class 
prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 



Ankylosing Spondylitis 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
ankylosing spondylitis and use of same. 

As used herein "ankylosing spondylitis" refers to a chronic inflammatory disease that 
affects the joints between the vertebrae of the spine, and/or the joints between the spine and 

5 the pelvis and can eventually cause the affected vertebrae to fuse or grow together. In each 
case, the diagnosis of heart failure was corroborated by a skilled Board certified physician. 
RNA expression profiles of Whole blood samples from individuals who were identified as 
having heart failure described herein as compared with RNA expression profiles from 
individuals not having hearat failure, were generated using GeneSpring™ software analysis 

10 as described herein. Hybridizations to create said RNA expression profiles were done using 
the Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and 
U133 Plus 2.0) as described herein (data not shown). Samples are clustered and marked as 
representing patients who have ankylosing spondylitis or control individuals. Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 

15 Whitney rank sum test or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with ankylosing spondylitis as compared 
with patients without ankylosing spondylitis are shown in Table 1 AL 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with ankylosing spondylitis can be done using the differentially 
20 expressed genes as shown in Table 1 AI in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

25 

EXAMPLE 3 

In addition to methods to identify biomarkers associated with a condition, this 
invention also includes methods to identify biomarkers that can identify markers for a 
30 condition in an individual or group of individuals, despite the presence of one or more second 
conditions in the same individual or group of individuals. The invention also includes 
methods to identify biomarkers of a co-morbid condition. The following examples illustrate 
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embodiments of methods comprising individuals presenting with Ostearthritis and various 
second conditions, but the invention is not limited to these sets of examples. 



Osteoarthritis and Hypertension 

5 ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 

blood samples from co-morbid individuals having osteoarthritis and hypertension as 
compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect biomarkers of 
patients with osteoarthritis and hypertension or use of same. 

10 Whole blood samples were taken from patients who were diagnosed with 

osteoarthritis and hypertension as defined herein. RNA expression profiles were then 
analysed and compared to profiles from patients unaffected by any disease. In each case, the 
diagnosis of osteoarthritis and hypertension was corroborated by a skilled Board certified 
physician . 

15 Total mRNA from whole blood wasisolated from each patient was isolated using 

TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K 
Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with disease as compared to 

20 healthy patients was determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 1 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals having hypertension and osteoarthritis as compared with 

25 RNA expression profiles from normal individuals. Expression profiles were generated using 
GeneSpring™ software analysis as described herein. Each column represents the 
hybridization pattern resulting from a single individual. In this example, hypertensive 
patients also presented with OA, as described herein. Normal individuals have no known 
medical conditions and were not taking any known medication. Hybridizations to create said 

30 RNA expression profiles were done using the ChondroChip™. A dendogram analysis is 
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shown above. Samples are clustered and marked as representing patients who are 
hypertensive or normal. The indicates those patients who abnormally clustered as either 
hypertensive, or normal despite presenting with the reverse. The number of hybridizations 
profiles determined for either hypertensive patients or normal individuals are shown. 861 
5 differentially expressed genes were identified as being differentially expressed with a p value 
of < 0.05 as between the hypertensive patients and normal individuals. The identity of the 
differentially expressed genes is shown in Table 1 A. 

Classification or class prediction of a test sample as having hypertension and OA or 
being normal can be done using the differentially expressed genes as shown in Table 1 A in 
10 combination with well known statistical algorithms for class prediction as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



15 Osteoarthritis and Obesity 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from co-morbid individuals to identify biomarkers of osteoarthritis and 
obesityRNA expression profiles. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in Whole blood samples taken from patients with obesity and OA as compared to 
Whole blood samples taken from healthy patients. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and obesity as defined herein. RNA expression profiles were then analysed and 
compared to profiles from patients unaffected by any disease. In each case, the diagnosis of 
the disease was corroborated by a skilled Board certified physician. Total mRNA from a 
blood taken from each patient was isolated using TRIzol® reagent (GIBCO) and 
fluorescently labeled probes for each blood sample were generated as described above. Each 
probe was denatured and hybridized to a 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with disease as compared to healthy patients was 
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determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

Figure 2 shows a diagrammatic representation of RNA expression profiles of Whole 
5 blood samples from individuals who were identified as obese as described herein as 

compared with RNA expression profiles from normal individuals. Expression profiles were 
generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. In this example, 
obese patients also presented with OA, as described herein. Normal individuals have no 

10 known medical conditions and were not taking any known medication. Hybridizations to 
create said RNA expression profiles were done using the ChondroChip™. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who 
are obese or normal. The "*" indicates those patients who abnormally clustered as either 
obese or normal despite presenting with the reverse. The number of hybridization profiles 

15 determined for obese patients with OA and normal individuals are shown. 913 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the obese 
patients with OA and normal individuals is noted. The identity of the differentially 
expressed genes is shown in Table IB. 

Classification or class prediction of a test sample as either having obesity and OA or 
20 being normal can be done using the differentially expressed genes as shown in Table IB in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 



25 Osteoarthritis and Allergies 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from co-morbid individuals having osteoarthritis and allergies as compared 
with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential 
30 biomarkers of osteoarthritis and allergies. 
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Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and allergies as defined herein. These patients are classified as presenting with 
co-morbidity, or multiple disease states. RNA expression profiles were then analysed and 
compared to profiles from patients unaffected by any disease, hi each case, the diagnosis of 
5 osteoarthritis and allergies was corroborated by a skilled Board certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to a 15K Chondrogene Microarray Chip 
(ChondroChip™) as described Herein. Identification of genes differentially expressed in 
10 Whole blood samples from patients with osteoarthritis and allergies as compared to healthy 
patients was determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 3 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were identified as having allergies as described herein 
as compared with RNA expression profiles from normal individuals. Expression profiles 
were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. In this example, 
patients with allergies also presented with OA, as described herein. Normal individuals had 
no known medical conditions and were not taking any known medication. Hybridizations 
to create said RNA expression profiles were done using the ChondroChip™ . A 
dendogram analysis is shown above. Samples are clustered and marked as representing 
patients who are obese or normal. The indicates those patients who abnormally 
clustered as either having allergies or being normal despite presenting with the reverse. The 
number of hybridizations profiles determined for patients with allergies and normal 
individuals are shown. 633 genes were identified as being differentially expressed with a p 
value of < 0.05 as between patients with allergies and normal individuals is noted. The 
identity of the differentially expressed genes is shown in Table 1C. 

Classification or class prediction of a test sample to determine whether said individual 
30 has allergies and OA or is normal can be doneusing the differentially expressed genes as 
shown in Table 1C in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
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programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 



Osteoarthritis and Systemic Steroids 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
5 blood samples from co-morbid individuals having osteoarthritis and subject to systemic 
steroids as compared with RNA expression profiles from normal individuals 

This example demonstrates the use of the claimed invention to detect biomarkers in 
blood of patients subject to systemic steroids and having osteoarthritis. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
10 steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
prednisone, and hormones as a result of hormone replacement treatment. A person identified 
as having systemic steroids is one who is on one or more of the following of the above 
treatment regimes. 

Whole blood samples were taken from patients who were diagnosed with 
15 osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 

were then analysed and compared to profiles from patients unaffected by any disease. In each 
case, the diagnosis of osteoarthritis and systemic steroids was corroborated by a skilled Board 
certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
20 (GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to the 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and subject to systemic steroids as 
compared to healthy patients was determined by statistical analysis using the Wilcox Mann 
25 Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Figure 4 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals who were subject to systemic steroids as described herein 
as compared with RNA expression profiles from normal individuals. Expression profiles 
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were generated using GeneSpring™ software analysis as described herein. Each column 
represents the hybridization pattern resulting from a single individual. In this example, 
patients taking systemic steroids also presented with OA, as described herein. Normal 
individuals have no known medical conditions and were not taking any known medication. 

5 Hybridizations to create said RNA expression profiles were done using the ChondroChip™ 
. (A dendogram analysis is shown above. Samples are clustered and marked as 
representing patients who are taking systemic steroids or normal. The indicates those 
patients who abnormally clustered as either systemic steroids or normal despite presenting 
with the reverse. The number of hybridizations profiles determined for patients with 

10 systemic steroids and normal individuals are shown. 605 genes were identified as being 
differentially expressed with a p value of < 0.05 as between patients with systemic steroids 
and normal individuals is noted. The identity of the differentially expressed genes is shown 
in Table ID. 

Classification or class prediction of a test sample from a patient as indicating said 
15 patient takes systemic steroids and has OA or as being normal can be done using the 
differentially expressed genes as shown in Table ID in combination with well known 
statistical algorithms for class prediction as would be understood by a person skilled in the art 
and is described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

20 

Osteoarthritis and Hypertension Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from individuals having osteoarthritis and hypertension as compared with 
RNA expression profiles from patients having osteoarthritis only. 

25 This example demonstrates the use of the claimed invention to identify biomarkersin 

Whole blood samples which are specific to hypertension by comparing gene expression in 
blood from co-morbid patients with osteoarthritis and hypertension to Whole blood samples 
taken from OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
30 osteoarthritis and hypertension as defined herein. RNA expression profiles were then 
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analysed and compared to profiles from patients having OA only. In each case, the diagnosis 
of osteoarthritis and/or hypertension was corroborated by a skilled Board certified physician . 



Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
(GIB CO) and fluorescently labeled probes for each blood sample were generated as described 
5 above. Each probe was denatured and hybridized to a 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with disease as compared to OA patients only was 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A. 
Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 
10 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). The gene list generated from this analysis was identified 
and those genes previously identified in Table 1 A removed so as to identify those genes 
which are unique to hypertension. 790 differentially expressed genes were identified as being 

15 differentially expressed with a p value of < 0.05 as between the OA and hypertensive patients 
when compared with OA individuals. 577 genes were identified as unique to hypertension. 
The identity of these differentially expressed genes are shown in Table 1G. A gene list is 
also provided of the 213 genes which were found in common as between those genes 
identified in Table 1 A and genes differentially expressed in Whole blood samples taken from 

>0 patients with osteoarthritis and hypertension as compared to Whole blood samples taken from 
OA patients only. The identity of these intersecting differentially expressed genes is shown 
in Table 1H and a venn diagram showing the relationship between the various groups of gene 
lists is found in Figure 7. 

Classification or class prediction of a test sample as having hypertension or not 
15 having hypertension can be done using the differentially expressed genes as shown in Table 
1G as the predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. Classification of individuals as having both OA and 
>0 hypertension using the genes in Table 1H can also be performed. 

Osteoarthritis and Obesity Compared with Osteoarthritis Only 
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ChondroChip™ Microarray Data Analysis of RNA expression profiles of 
Whole blood samples from co-morbid individuals having osteoarthritis and obesity as 
compared with RNA expression profiles from patients having osteoarthritis only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
5 Whole blood samples which are specific to obesity by comparing gene expression in blood 
from co-morbid patients with osteoarthritis and obesity to Whole blood samples taken from 
OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and obesity as defined herein. RNA expression profiles were then analysed and 
10 compared to profiles from patients affected by OA only. 

In each case, the diagnosis of the disease was corroborated by a skilled Board 
certified physician. Total mRNA from blood taken from each patient was isolated using 
TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K 
15 Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with obesity and OA as 
compared to OA patients only was determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). 671 genes were identified as being differentially 
expressed with a p value of < 0.05 as between the obese patients with OA and those patients 
with only OA. Those genes previously identified in Table IB were removed so as to identify 
those genes which are unique to obesity. The identity of these 519 genes unique to obesity 
are shown in Table II. A gene list is also provided of those genes which were found in 
common as*between those genes identified in Table IB and genes differentially expressed in 
Whole blood samples taken from patients with osteoarthritis and obesity as compared to 
Whole blood samples taken from OA patients only. 152 genes are shown in Table 1 J. A 
venn diagram showing the relationship between the various groups of gene lists is found in 
Figure 8. 
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Classification or class prediction of a test sample as having obesity or not having obesity 
can be done using the differentially expressed genes as shown in Table II as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as 
5 those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. Classification of individuals as having both OA and obesity using the genes in 
Table 1 J can also be performed.Osteoarthritis and Allergies Compared with 
Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
10 blood samples from individuals having osteoarthritis (OA) and allergies as compared with 
RNA expression profiles from individuals with OA only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to allergies by comparing gene expression in blood 
from co-morbid patients with osteoarthritis and allergies to Whole blood samples taken from 
1 5 OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and allergies as defined herein. RNA expression profiles were then analysed 
and compared to profiles from patients affected by OA only. In each case, the diagnosis of 
osteoarthritis and allergies was corroborated by a skilled Board certified physician. 

20 Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 

(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to a 15K Chondrogene Microarray Chip 
(ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and allergies as compared to OA 

25 patients only was determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). 498 genes were identified as being differentially 
30 expressed with a p value of < 0.05 as between patients with allergies and OA as compared 
with patients with OA only. Of the 498 genes identified, those genes previously identified in 
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Table 1C were removed so as to identify those genes which are unique to allergies. 257 
differentially expressed genes were identified as being as unique to allergies. The identity of 
these differentially expressed genes are shown in Table IK. A gene list is also provided of 
the 241 genes which were found in common as between those genes identified in Table 3C 
5 and genes differentially expressed in Whole blood samples taken from patients with 
osteoarthritis and allergies as compared to Whole blood samples taken from OA patients 
only. The identity of these intersecting differentially expressed genes is shown in Table 1L 
and a venn diagram showing the relationship between the various groups of gene lists is 
found in Figure 9. 

10 Classification or class prediction of a test sample as having allergies or not having 

allergies can be done using the differentially expressed genes as shown in Table IK as the 
predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 

15 Predication are also available. Classification of individuals as having both OA and allergies 
using the genes in Table 1L can also be performed. 



RNA expression profilesRNA expression profilesRNA expression profilesRNA expression 
profilesRNA expression profilesRNA expression profilesOsteoarthritis and Systemic 
20 Steroids Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
blood samples from co-morbid individuals having osteoarthritis and subject to systemic 
steroids as compared with RNA expression profiles from with osteoarthritis only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
25 Whole blood samples which are specific to systemic steroids by comparing gene expression 
in blood from co-morbid patients with osteoarthritis and systemic steroids to Whole blood 
samples taken from OA patients only. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 
30 were then analysed and compared to profiles from patients having OA only. In each case, the 
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diagnosis of osteoarthritis and systemic steroids was corroborated by a skilled Board certified 
physician. 



Total mRNA from blood was taken from each patient was isolated using TRIzol® 
reagent (GIBCO) and fluorescently labeled probes for each blood sample were generated as 

5 described above. Each probe was denatured and hybridized to the 15K Chondrogene 

Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in Whole blood samples from patients with osteoarthritis and subject to systemic 
steroids as compared patients with OA only was determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, 

10 USA: McGraw-Hill Medical Publishing Division, 2002). 

Expression profiles were generated using GeneSpring™ software analysis as 
described herein (data not shown). 553 genes were identified as being differentially 
expressed with a p value of < 0.05 as between patients taking systemic steroids and OA as 
compared with patients with OA only. Of the 553 genes identified, those genes previously 

15 identified in Table ID were removed so as to identify those genes which are unique to 

systemic steroids. 362 differentially expressed genes were identified as being as unique to 
systemic steroids. The identity of these differentially expressed genes are shown in Table 
1M. A gene list is also provided of the 191 genes which were found in common as between 
those genes identified in Table 3D and genes differentially expressed in Whole blood samples 

20 taken from patients with osteoarthritis and systemic steroids as compared to Whole blood 
samples taken from OA patients only. The identity of these intersecting differentially 
expressed genes is shown in Table IN and a venn diagram showing the relationship between 
the various groups of gene lists is found in Figure 10. 

Classification or class prediction of a test sample of an individual as either taking 
25 systemic steroids or not taking systemic steroids can be done using the differentially 
expressed genes as shown in Table 1M as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. Classification of 
30 individuals as having both OA and taking systemic steroids using the genes in Table IN can 
also be performed. 
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Osteoarthritis and Systemic Steroids Compared with Normal so as to Differentiate 
Between Types of Systemic Steroids. 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
5 blood samples from co-morbid individuals having osteoarthritis and subject to systemic 
steroids as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to individual types of systemic steroids by 
comparing gene expression in blood from co-morbid patients with osteoarthritis and either 
10 on prednisone, birth control pills or taking hormones to Whole blood samples taken from OA 
patients only. 

As used herein, "systemic steroids" indicates a person subjected to artificial levels of 
steroids as a result of medical intervention. Such systemic steroids include birth control pills, 
prednisone, and hormones as a result of hormone replacement treatment. A person identified 
15 as having systemic steroids is one who is on one or more of the following of the above 
treatment regimes. 

Whole blood samples were taken from patients who were diagnosed with 
osteoarthritis and subject to systemic steroids as defined herein. RNA expression profiles 
were then analysed and compared as between the systemic steroids as compared to profiles 
20 from patients unaffected by any disease. In each case, the diagnosis of osteoarthritis and 
systemic steroids was corroborated by a skilled Board certified physician. 

Total mRNA from blood taken from each patient was isolated using TRIzol® reagent 
(GIBCO) and fluorescently labeled probes for each blood sample were generated as described 
above. Each probe was denatured and hybridized to the 15K Chondrogene Microarray Chip 
25 (ChondroChip™) as described herein. Identification of genes differentially expressed in 
Whole blood samples from patients with osteoarthritis and subject to systemic steroids as 
compared to healthy patients was determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 
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Figure 1 1 shows a diagrammatic representation of RNA expression profiles of 
Whole blood samples from individuals who were subject to either birth control, prednisone, 
or hormone replacement therapy as described herein as compared with RNA expression 
profiles from normal individuals. Expression profiles were generated using GeneSpring™ 
software analysis as described herein. Each column represents the hybridization pattern 
resulting from a single individual. In this example, patients taking with each of the 
systemic steroids also presented with OA, as described herein. Normal individuals have no 
known medical conditions and were not taking any known medication. Hybridizations to 
create said RNA expression profiles were done using the ChondroChip™. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who 
are taking birth control, prednisone, hormone replacement therapy or normal. The "*" 
indicates those patients who abnormally clustered. The number of hybridizations profiles 
determined for patients with birth control, prednisone, hormone replacement therapy or 
normal individuals are shown. 396 genes were identified as being differentially expressed 
with a p value of < 0.05 as between patients with systemic steroids and normal individuals 
is noted. The identity of the differentially expressed genes is shown in Table lO. 

Classification or class prediction of a test sample from a patient as indicating said 
patient takes systemic steroids and has OA or as being normal can be done using the 
differentially expressed genes as shown in Table lO in combination with well known 
20 statistical algorithms for class prediction as would be understood by a person skilled in the art 
and is described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Osteoarthritis and Asthma Compared with Osteoarthritis Only 

ChondroChip™ Microarray Data Analysis of RNA expression profiles of Whole 
25 blood samples from individuals having osteoarthritis (OA) and asthma as compared with 
RNA expression profiles from individuals with OA only. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to asthma. 

Whole blood samples were taken from patients who were diagnosed with 
30 osteoarthritis and asthma as defined herein. RNA expression profiles were then analysed and 
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compared to profiles from patients affected by asthma only. In each case, the diagnosis of 
osteoarthritis and asthma was corroborated by a skilled Board certified physician. 



Total mRNA from blood was taken from each patient was isolated using TRIzol® 
reagent (GIBCO) and fluorescently labeled probes for each blood sample were generated as 

5 described above. Each probe was denatured and hybridized to a 15K Chondrogene 

Microarray Chip (ChondroChip™) as described herein. Identification of genes differentially 
expressed in Whole blood samples from patients with osteoarthritis and asthma as compared 
to OA patients only was determined by statistical analysis using the Wilcox Mann Whitney 
rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill 

L0 Medical Publishing Division, 2002). Figure 24 shows a diagrammatic representation of RNA 
expression profiles of Whole blood samples from individuals who had asthma and 
osteoarthritis as described herein as compared with RNA expression profiles from 
osteoarthritic individuals. Expression profiles were generated using GeneSpring™ software 
analysis as described herein. Each column represents the hybridization pattern resulting from 

15 a single individual. Hybridizations to create said RNA expression profiles were done using 
the ChondroChip™ and the Affymetrix Chip. (A dendogram analysis is shown above). 
Samples are clustered and marked as representing patients who have asthma and OA or those 
patients who have just OA. The number of hybridizations profiles determined for patients 
with asthma and patients without asthma are shown. Various experiments were performed 

20 using the ChondroChip™as outlined above and analyzed using either the Wilcox Mann 

Whitney rank sum test, or other statistical tests as described herein, and those genes identified 
with a p value of < 0.05 as between the patients with asthma and OA and patients with just 
OA are shown in Table 1 AC. Additionally experiments were performed using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and 

25 U133 Plus 2.0) as described herein (data not shown) using either the Wilcox Maim Whitney 
rank sum test, or other statistical tests as described herein, and those genes identified with a p 
value of < 0.05 as between the patients with asthma and without asthma are shown in Table 
IAD . 

EXAMPLE 4 

30 In addition to methods to identify biomarkers associated with a specific disease or 

condition, this invention also includes methods to identify biomarkers that distinguish 
between different stages of the condition. The following examples illustrate embodiments 
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of the application of the instant methods as applied to identifiying biomarkers associated with 
specific stages of bladder cancer and osteoarthritis, however, this aspect of the invention is 
not limited to these particular conditions. 

Bladder Cancer 

5 Affymetrix Chip Microarray Data Analysis of RNA expression profiles of Whole 

blood samples from individuals having early or advanced bladder cancer as compared with 
RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to identify biomarkers in 
Whole blood samples which are specific to a stage of bladder cancer by comparing gene 
10 expression in blood from individuals with advanced bladder cancer and those without 
bladder cancer. 

As used herein, "early stage bladder cancer" includes bladder cancer wherein the 
detection of the anatomic extent of the tumor, both in its primary location and in metastatic 
sites, as defined by the TNM staging system in accordance with Harrison's Principles of 
15 Internal Medicine 14th edition can be considered early stage. More specifically, early stage 
bladder cancer can include those instances wherein the carcinoma is mainly superficial. 

As used herein, "advanced stage bladder cancer" is defined as bladder cancer wherein 
the detection of the anatomic extent of the tumor, both in its primary location and in 
metastatic sites, as defined by the TNM staging system in accordance with Harrison's 
20 Principles of Internal Medicine 14th edition, can be considered as advanced stage. More 
specifically, advanced stage carcinomas can involve instances wherein the cancer has 
infiltrated the muscle and wherein metastasis has occurred. 

Whole blood samples were taken from patients who were diagnosed with early or 
advanced late stage bladder cancer as defined herein. RNA expression profiles were then 
25 analysed and compared to profiles from patients unaffected by any disease. In each case, the 
diagnosis of early or advanced late stage bladder cancer was corroborated by a skilled Board 
certified physician . 

Total mRNA from ablood sample taken from each patient was isolated using TRIzol® 
reagent (GIBCO) and fluorescently labeled probes for each blood sample were generated as 
30 described above. Each probe was denatured and hybridized to a Affymetrix U133A Chip as 
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described herein. Identification of genes differentially expressed in Whole blood samples 
from patients with early or advanced late stage bladder cancer as compared to healthy 
patients was determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
(Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: McGraw-Hill Medical 
5 Publishing Division, 2002). 

Figure 16 shows a diagrammatic representation of RNA expression profiles of Whole blood 
samples from individuals who were identified as having advanced stage bladder cancer or 
early stage bladder cancer as described herein as compared with RNA expression profiles 
from non bladder cancer individuals. Expression profiles were generated using 

0 GeneSpring™ software analysis as described herein. Each column represents the 

hybridization pattern resulting from a single individual. Non bladder cancer individuals 
presented without bladder cancer, but may have presented with other medical conditions and 
may be under various treatment regimes. Hybridizations to create said RNA expression 
profiles were done using the Affymetrix U133A chip. A dendogram analysis is shown above. 

15 Samples are clustered and marked as representing patients who have early stage bladder 
cancer, advanced stage bladder cancer, or do not have bladder cancer. The "*" indicates 
those patients who abnormally clustered despite actual presentation. The number of 
hybridizations profiles determined for either early stage bladder cancer, advanced bladder 
cancer or non-bladder cancer are shown. 3,518 genes were identified as being differentially 

20 expressed with a p value of < 0.05 using an ANOVA analysis. The identity of the 

differentially expressed genes identified is shown in Table IT. Various experiments were 
also performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank 
sum test or other statistical tests as described herein, and those genes identified with a p value 
of < 0.05 as between the patients with any stage of advanced bladder cancer as compared 

25 with patients without bladder cancer are shown in Table 5V. 

Classification or class prediction of a test sample of an individual to determine 
whether said individual has advanced bladder cancer, early stage bladder cancer or does not 
have bladder cancer can be done using the differentially expressed genes as shown in Table 
30 IT and/or 5V as the predictor genes in combination with well known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
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available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 

Osteoarthritis Staging 

This example demonstrates the use of the claimed invention to identify 
5 biomarkers in Whole blood samples which are specific various stages of osteoarthritis so as 
to allow the monitoring (progression or regression) of disease. 

Osteoarthritis (OA), as used herein also known as "degenerative joint disease", represents 
failure of a diarthrodial (movable, synovial-lined) joint. It is a condition, which affects joint 
cartilage, and or subsequently underlying bone and supporting tissues leading to pain, 
10 stiffness, movement problems and activity limitations. It most often affects the hip, knee, 
foot, and hand, but can affect other joints as well. 

OA severity can be graded according to the system described by Marshall (Marshall 
KW. J Rheumatol, 1996:23(4) 582-85). Briefly, each of the six knee articular surfaces was 
assigned a cartilage grade with points based on the worst lesion seen on each particular 
surface. Grade 0 is normal (0 points), Grade I cartilage is soft or swollen but the articular 
surface is intact (1 point). In Grade II lesions, the cartilage surface is not intact but the lesion 
does not extend down to subchondral bone (2 points). Grade III damage extends to 
subchondral bone but the bone is neither eroded nor eburnated (3 points). In Grade IV 
lesions, there is eburnation of or erosion into bone (4 points). A global OA score is 
calculated by summing the points from all six cartilage surfaces. If there is any associated 
pathology, such as meniscus tear, an extra point will be added to the global score. Based on 
the total score, each patient is then categorized into one of four OA groups: mild (1-6), 
moderate (7-12), marked (13-18), and severe (>18). As used herein, patients identified with 
OA may be categorized in any of the four OA groupings as described above. 

25 Whole blood samples were taken from patients who were diagnosed with 

osteoarthritis and a specific stage of osteoarthritis as defined herein. RNA expression profiles 
were then analysed and compared to profiles from patients unaffected by any disease. In 
each case, the diagnosis of osteoarthritis and the stage of osteoarthritis was corroborated by a 
skilled Board certified physician . 
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Total mRNA from a blood sample taken from each patient was isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a 15K 
Chondrogene Microarray Chip (ChondroChip™) as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with disease as compared to 
healthy patients was determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics., 5th ed. New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Figure 20 shows a diagrammatic representation of RNA expression profiles of Whole 
blood samples from individuals having osteoarthritis as compared with RNA expression 
profiles from normal individuals. Expression profiles were generated using GeneSpring™ 
software analysis as described herein. Each column represents the hybridization pattern 
resulting from a single individual. Normal individuals have no known medical conditions 
and were not taking any known medication. Hybridizations to create said RNA expression 
profiles were done using the ChondroChip™ and the Affymetrix™ Chip. A dendogram 
analysis is shown above. Samples are clustered and marked as representing patients who 
presented with different stages of osteoarthritis or normal. The "*" indicates those patients 
who abnormally clustered despite actual presentation. The number of hybridizations profiles 
determined for either osteoarthritis patients or normal individuals are shown. Differentially 
expressed genes were identified as being differentially expressed using ANOVA analysis and 
those genes with a p value of < 0.05 identified. The identity of the differentially expressed 
genes is shown in Tables 1 Y. In addition, various experiments were also performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein, and using a pairwise comparison, those genes identified 
with a p value of < 0.05 as between the patients with any stage of osteoarthritis as compared 
with patients without osteoarthritis are shown in Table 4 A and 4B. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with osteoarthritis can be done using the differentially expressed 
genes as shown in Table 4A and 4B in combination with well known statistical algorithms for 
class prediction as would be understood by a person skilled in the art and is described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 
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Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and normal individuals. 
The identity of the differentially expressed genes is shown in Tables 4C and 4D. 
Differentially expressed genes were also identified as being differentially expressed with a p 

5 value of < 0.05 as between patients with moderate osteoarthritis and normal individuals. 
Classification or class prediction of a test sample of an individual to determine whether said 
individual has mild osteoarthritis can be done using the differentially expressed genes as 
shown in Table 4C and/or 4D as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 

LO herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and normal 

15 individuals. The identity of the differentially expressed genes is shown in Tables 4E and 4F. 
Classification or class prediction of a test sample of an individual to determine whether said 
individual has moderate osteoarthritis can be done using the differentially expressed genes as 
shown in Table 4E and/or 4F as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 

20 herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with marked osteoarthritis and normal 
individuals. The identity of the differentially expressed genes is shown in Tables 4G and 4H. 

25 Classification or class prediction of a test sample of an individual to determine 

whether said individual has marked osteoarthritis can be done using the differentially 
expressed genes as shown in Table 4G and/or 4H as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 

30 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with severe osteoarthritis and patients without 
osteoarthritis. The identity of the differentially expressed genes is shown in Tables 41 and 4 J. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has severe osteoarthritis can be done using the differentially 
expressed genes as shown in Table 41 and/or 4J as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with 
moderate osteoarthritis. The identity of the differentially expressed genes is shown in Tables 
4K and 4L. Classification or class prediction of a test sample of an individual to differentiate 
as to whether said individual has mild or moderate osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4K and/or 4L as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with marked 
osteoarthritis. The identity of the differentially expressed genes is shown in Tables 4M and 
4N. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has mild or marked osteoarthritis can be done using the differentially 
expressed genes as shown in Table 4M and/or 4N as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with mild osteoarthritis and patients with 
severeosteoarthritis. The identity of the differentially expressed genes is shown in Tables 40 
5 and 4P. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has mild or severe osteoarthritis can be done using the differentially 
expressed genes as shown in Table 40 and/or 4P as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
10 described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and patients with 
15 marked osteoarthritis. The identity of the differentially expressed genes is shown in Tables 
4Q and 4R. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has moderate or marked osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4Q and/or 4R as the predictor genes in 
20 combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with moderate osteoarthritis and patients with 
25 severe osteoarthritis. The identity of the differentially expressed genes is shown in Tables 4S 
and 4T. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has moderate or severe osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4S and/or 4T as the predictor genes in 
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combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Differentially expressed genes were also identified as being differentially expressed 
with a p value of < 0.05 as between patients with marked osteoarthritis and patients with 
severe osteoarthritis. The identity of the differentially expressed genes is shown in Tables 4U 
and 4V. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has marked or severe osteoarthritis can be done using the 
differentially expressed genes as shown in Table 4U and/or 4V as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

EXAMPLE 5 

In addition to methods to identify biomarkers associated with a specific disease or 
condition, or stage thereof, this invention also includes methods to identify biomarkers that 
distinguish between two conditions. The pair of conditions can be closely related, can have 
unrelated etiology but display similar overt symptoms, or can be unrelated. The following 
examples illustrate embodiments of methods of this aspect of the invention, but the invention 
is not limited to these embodiments. 

Manic Depression Syndrome as Compared with Schizophrenia RNA expression 
profiles 

. This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differentiating between manic depression syndrome and schizophrenia 
and use of same. 
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Whole blood samples were taken from patients diagnosed with MDS and 
Whole blood samples were taken from patients diagnosed with schizophrenia as defined 
herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having MDS compared with the profiles generated for individuals having 
schizophrenia. In each case, the diagnosis of MDS and schizophrenia is corroborated by a 
skilled Board certified physician. RNA expression profiles of Whole blood samples from 
individuals who were identified as having MDS as described herein as compared with RNA 
expression profiles from individuals identified as having schizophrenia were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, and 

those genes identified with a p value of < 0.05 as between the patients with MDS as 

i 

compared with patients schizophrenia are shown in Table 3A. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has schizophrenia or MDS can be done using the differentially 
expressed genes as shown in Table 3 A as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Hepatitis as compared with Liver Cancer RNA expression profiles 

This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differentiating between hepatitis B and liver cancer and use of same. 

Whole blood samples were taken from patients diagnosed with hepatitis B and Whole 
blood samples were taken from patients diagnosed with liver cancer as defined herein. RNA 
expression profiles from were then analyzed and the profiles generated for individuals having 
hepatitis B compared with the profiles generated for individuals having liver cancer. In each 
case, the diagnosis of hepatitis B or liver cancer is corroborated by a skilled Board certified 
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physician. RNA expression profiles of Whole blood samples from individuals who were 
identified as having hepatitis B as described herein as compared with RNA expression 
profiles from individuals identified as having schizophrenia were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/orU133 Plus 2.0) as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with MDS as 
compared with patients schizophrenia are shown in Table 3B. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has hepatitis or liver cancer can be done using the differentially 
expressed genes as shown in Table 3B as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

Bladder Cancer as Compared with Kidney Cancer RNA expression profiles 

This example demonstrates the use of the claimed invention to identify 
biomarker which are capable of differentiating between bladder cancer and kidney cancer and 
use of same. 

Whole blood samples were taken from patients diagnosed with bladder cancer 
and Whole blood samples were taken from patients diagnosed with kidney canceras defined 
herein. RNA expression profiles were then analyzed and the profiles generated. In each 
case, the diagnosis of bladder cancer and kidney cancer was corroborated by a skilled Board 
certified physician. RNA expression profiles of Whole blood samples from individuals who 
were identified as having bladder cancer as described herein as compared with RNA 
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expression profiles from individuals identified as having kidney cancer were generated using 
GeneSpring™ software analysis as described herein. Hybridizations to create said RNA 
expression profiles were done using the Affymetrix® GeneChip® platforms (U133A and 
U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) as described herein (data not 
5 shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, and 
those genes identified with a p value of < 0.05 as between the patients with bladder cancer as 
compared with patients with kidney cancer are shown in Table 3C. 

Classification or class prediction of a test sample of an individual to differentiate as to 
10 whether said individual has bladder cancer or kidney cancer can be done using the 

differentially expressed genes as shown in Table 3C as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

1 5 RNA expression profiles 

Bladder Cancer as compared with Testicular CancerRNA Expression profiles 

This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differentiating between bladder cancer and testicular cancer and use of 
20 same. 

Whole blood samples were taken from patients diagnosed with bladder cancer 
and Whole blood samples were taken from patients diagnosed with testicular cancer as 
defined herein. RNA expression profiles were then analyzed and the profiles, generated for 
individuals having bladder cancer as compared with the profiles generated for individuals 

25 having testicular cancer. In each case, the diagnosis of bladder cancer and testicular cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having bladder cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 
testicular cancer were generated using GeneSpring™ software analysis as described herein. 

30 Hybridizations to create said RNA expression profiles were done using the Affymetrix® 

GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) 
as described herein (data not shown). Various experiments were performed as outlined 
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above, and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical 
tests as described herein, and those genes identified with a p value of < 0.05 as between the 
patients with bladder cancer as compared with patients testicular cancer are shown in Table 
3D. 

5 Classification or class prediction of a test sample of an individual to differentiate as to 

whether said individual has bladder cancer or testicular cancer can be done using the 
differentially expressed genes as shown in Table 3D as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
10 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



RNA expression profilesKidney Cancer as compared with Testicular CancerRNA 

Expression Profiles This example demonstrates the use of the claimed invention to identify 
15 biomarker which are capable of differentiating between kidney cancer and testicular cancer 
and use of same. 

Whole blood samples were taken from patients diagnosed with kidney cancer 
and Whole blood samples were taken from patients diagnosed with testicular cancer as 
defined herein. RNA expression profiles were then analyzed and the profiles generated for 

20 individuals having kidney cancer as compared with the profiles generated for individuals 

having testicular cancer. In each case, the diagnosis of kidney cancer and testicular cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having kidney cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 

25 testicular cancer were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and/or U133 Plus 2.0) 
as described herein (data not shown). Various experiments were performed as outlined 
above, and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical 

30 tests as described herein, and those genes identified with a p value of < 0.05 as between the 
patients with bladder cancer as compared with patients with testicular cancer are shown in 
Table 3E. 
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Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has bladder cancer or testicular cancer can be done using the 
differentially expressed genes as shown in Table 3E as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
5 and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 



10 Liver Cancer as compared with Stomach Cancer RNA expression profiles 

This example demonstrates the use of the claimed invention to identify biomarker 
which are capable of differentiating between liver cancer and stomach cancer and use of 
same. 

Whole blood samples were taken from patients diagnosed with liver cancer 
15 and Whole blood samples were taken from patients diagnosed with stomach cancer as 

defined herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having liver cancer as compared with the profiles generated for individuals 
having stomach cancer. In each case, the diagnosis of liver cancer and stomach cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
20 blood samples from individuals who were identified as having liver cancer as described 
herein as compared with RNA expression profiles from individuals identified as having 
stomach cancer were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
25 Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with bladder cancer as compared 
with patients testicular cancer are shown in Table 3F. 

Classification or class prediction of a test sample of an individual to differentiate as to 
30 whether said individual has bladder cancer or testicular cancer can be done using the 

differentially expressed genes as shown in Table 3F as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
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and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



RNA expression profiles 

5 Liver Cancer as compared with Colon Cancer 

This example demonstrates the use of the claimed invention to identify 
biomarker which are capable of differentiating between liver cancer and colon cancer and use 
of same. 

Whole blood samples were taken from patients diagnosed with liver cancer 
10 and Whole blood samples were taken from patients diagnosed with colon cancer as defined 
herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having liver cancer as compared with the profiles generated for individuals 
having colon cancer. In each case, the diagnosis of liver cancer and colon cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
15 blood samples from individuals who were identified as having liver cancer as described 

herein as compared with RNA expression profiles from individuals identified as having colon 
cancer were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
20 Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with liver cancer as compared with 
patients with colon cancer are shown in Table 3G. 

Classification or class prediction of a test sample of an individual to differentiate as to 
25 whether said individual has liver cancer or colon cancer can be done using the differentially 
expressed genes as shown in Table 3G as the predictor genes in combination with well 
known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

30 
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3G. 

Stomach Cancer as compared with Colon Cancer 

This example demonstrates the use of the claimed invention to identify 
biomarkers which are capable of differentiating between stomach cancer and colon cancer 
5 and use of same. 

Whole blood samples were taken from patients diagnosed with stomach cancer 
and Whole blood samples were taken from patients diagnosed with colon cancer as defined 
herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having stomach cancer as compared with the profiles generated for individuals 

10 having colon cancer. In each case, the diagnosis of stomach cancer and colon cancer is 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having stomach cancer as described 
herein as compared with RNA expression profiles from individuals identified as having colon 
cancer were generated using GeneSpring™ software analysis as described herein. 

15 Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with stomach cancer as compared 

20 with patients colon cancer are shown in Table 3H. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has stomach cancer or colon cancer can be done using the 
differentially expressed genes as shown in Table 3H as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
25 and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 



Osteoarthritis as compared with Rheumatoid Arthritis 

30 This example demonstrates the use of the claimed invention to identify 

biomarkers which are capable of differentiating between OA and RA and use of same. 
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Whole blood samples were taken from patients diagnosed with OA and Whole 
blood samples were taken from patients diagnosed with RA as defined herein. RNA 
expression profiles were then analyzed and the profiles generated for individuals having OA 
as compared with the profiles generated for individuals having RA. In each case, the 
5 diagnosis of OA and RA is corroborated by a skilled Board certified physician. RNA 

expression profiles of Whole blood samples from individuals who were identified as having 
OA as described herein as compared with RNA expression profiles from individuals 
identified as having RA were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profiles were done using the 
10 Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data 
not shown). Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 
and those genes identified with a p value of < 0.05 as between the patients with OA as 
compared with patients with RA are shown in Table 31. 

15 Classification or class prediction of a test sample of an individual to differentiate as to 

whether said individual has OA or RA can be done using the differentially expressed genes as 
shown in Table 31 as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 

20 GeneSpring™) for Class Prediction are also available. 



Chagas Disease as compared with Heart Failure This example demonstrates the 
use of the claimed invention to identify biomarkers which are capable of differentiating 
between Chagas' disease and heart failure and use of same. 

25 Whole blood samples were taken from patients diagnosed with Chagas' 

disease and Whole blood samples were taken from patients diagnosed with heart failure as 
defined herein. RNA expression profiles were then analyzed and the profiles generated for 
individuals having Chagas' disease as compared with the profiles generated for individuals 
having heart failure. In each case, the diagnosis of Chagas' disease and heart failure is 

30 corroborated by a skilled Board certified physician. RNA expression profiles of Whole 
blood samples from individuals who were identified as having Chagas' disease as described 
herein as compared with RNA expression profiles from individuals identified as having heart 
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failure were generated using GeneSpring™ software analysis as described herein. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data not shown). 
Various experiments were performed as outlined above, and analyzed using either the Wilcox 
5 Mann Whitney rank sum test or other statistical tests as described herein, and those genes 
identified with a p value of < 0.05 as between the patients with Chagas' disease as compared 
with patients with heart failure are shown in Table 31. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Chagas' disease or heart failure can be done using the 
10 differentially expressed genes as shown in Table 31 as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

15 Chagas Disease as compared with Coronary Artery Disease This example 

demonstrates the use of the claimed invention to identify biomarkers which are capable of 
differentiating between Chagas' disease and coronary artery disease and use of same. 

Whole blood samples were taken from patients diagnosed with Chagas' 
disease and Whole blood samples were taken from patients diagnosed with coronary artery 
20 disease as defined herein. RNA expression profiles were then analyzed and the profiles 

generated for individuals having stomach cancer as compared with the profiles generated for 
individuals having coronary artery disease. In each case, the diagnosis of Chagas' disease 
and coronary artery disease is corroborated by a skilled Board certified physician. RNA 

expression profiles of Whole blood samples from individuals who were identified as having 

i 

25 Chagas' disease as described herein as compared with RNA expression profiles from 

individuals identified as having coronary artery disease were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 

30 and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
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with Chagas' disease as compared with patients coronary artery disease are shown in Table 
3L. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Chagas 5 disease or coronary artery disease can be done using the 
differentially expressed genes as shown in Table 3L as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 

RNA expression profilesRNA expression profilesRNA expression profilesRNA 
expression profiles. 

Coronary Artery Disease (CAD) as compared with Heart Failure 

This example demonstrates the use of the claimed invention to identify biomarkers 
15 which are capable of differentiating between Coronary Artery Disease (CAD) and Heart 
Failure and use of same. 

Whole blood samples were taken from patients diagnosed with having Coronary 
Artery Disease (CAD) and Whole blood samples were taken from patients diagnosed with 
having Heart Failure as defined herein. RNA expression profiles were then analyzed and the 

20 profiles generated for individuals having CAD as compared with the profiles generated for 
individuals heart failure. In each case, the diagnosis of heart failure and coronary artery 
disease is corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples from individuals who were identified as having coronary artery disease 
as described herein as compared with RNA expression profiles from individuals identified as 

25 having heart failure were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profiles were done using the 
Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein (data 
not shown). Various experiments were performed as outlined above, and analyzed using 
either the Wilcox Mann Whitney rank sum test or other statistical tests as described herein, 

30 and those genes identified with a p value of < 0.05 as between the patients with coronary 
artery disease as compared with patients with heart failure are shown in Table 3N. 
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Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has coronary artery disease or heart failure can be done using the 
differentially expressed genes as shown in Table 3N as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

RNA expression profiles 
Asymptomatic Chagas Disease as compared with Symptomatic Chagas Disease 

This example demonstrates the use of the claimed invention to identify biomarkers 
which are capable of differentiating between Asymptomatic Chagas Disease and 
Symptomatic Chagas Disease and use of same. 

Whole blood samples were taken from patients diagnosed with having Asymptomatic 
Chagas Disease and Whole blood samples were taken from patients diagnosed with 
Symptomatic Chagas Disease as defined herein. RNA expression profiles were then 
analyzed and the profiles generated for individuals having Asymptomatic Chagas Disease as 
compared with the profiles generated for individuals with Symptomatic Chagas Disease. In 
each case, the diagnosis of Asymptomatic Chagas Disease and and Symptomatic Chagas 
Disease is corroborated by a skilled Board certified physician. RNA expression profiles of 
Whole blood samples from individuals who were identified as having Asymptomatic Chagas 
Disease as described herein as compared with RNA expression profiles from individuals 
identified as having Symptomatic Chagas Disease were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 
were done using the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 
and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with Asymptomatic Chagas Disease as compared with patients with Symptomatic Chagas 
Disease are shown in Table 3P. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Asymptomatic Chagas Disease or Symptomatic Chagas Disease 
can be done using the differentially expressed genes as shown in Table 3P as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
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person skilled in the art and described herein. Commercially available programs such as 
those provided by Silicon Genetics (e.g. GeneSpring™) for Class Prediction are also 
available. 



Alzheimer's Disease as compared with Schizophrenia 

This example demonstrates the use of the claimed invention to identify biomarkers 
which are capable of differentiating between Alzheimer's Disease and Schizophrenia and use 
of same. 

10 Whole blood samples were taken from patients diagnosed with having Alzheimer's 

Disease Disease and Whole blood samples were taken from patients diagnosed with 
Schizophrenia as defined herein. RNA expression profiles were then analyzed and the 
profiles generated for individuals having Alzheimer's Disease as compared with the profiles 
generated for individuals with Schizophrenia . In each case, the diagnosis of Alzheimer's 

15 Disease and Schizophrenia is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples from individuals who were identified as having 
Alzheimer's Disease as described herein as compared with RNA expression profiles from 
individuals identified as Schizophrenia were generated using GeneSpring™ software analysis 
as described herein. Hybridizations to create said RNA expression profiles were done using 

20 the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as described herein 
(data not shown). Various experiments were performed as outlined above, and analyzed 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein, and those genes identified with a p value of < 0.05 as between the patients with 
Alzheimer's Disease as compared with patients Schizophrenia are shown in Table 3Q. 

25 Classification or class prediction of a test sample of an individual to differentiate as to 

whether said individual has Alzheimer's Disease or Schizophrenia can be done using the 
differentially expressed genes as shown in Table 3Q as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 

30 Genetics (e.g. GeneSpring™) for Class Prediction are also available. 
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Alzheimer's Disease as compared with Manic Depression 

This example demonstrates the use of the claimed invention to identify biomarkers 
5 which are capable of differentiating between Alzheimer's Disease and Manic Depression and 
use of same. 

Whole blood samples were taken from patients diagnosed with having Alzheimer's 
Disease Disease and Whole blood samples were taken from patients diagnosed with Manic 
Depression as defined herein. RNA expression profiles were then analyzed and the profiles 

10 generated for individuals having Alzheimer's Disease as compared with the profiles 

generated for individuals with Manic Depression. In each case, the diagnosis of Alzheimer's 
Disease and Manic Depression is corroborated by a skilled Board certified physician. RNA 
expression profiles of Whole blood samples from individuals who were identified as having 
^Alzheimer's Disease as described herein as compared with RNA expression profiles from 

15 individuals identified as Manic Depression were generated using GeneSpring™ software 
analysis as described herein. Hybridizations to create said RNA expression profiles were 
done using the Affymetrix® GeneChip® platforms (U133A and/or U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 
and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 

20 described herein, and those genes identified with a p value of < 0.05 as between the patients 
with Alzheimer's Disease as compared with patients Manic Depression are shown in Table 
3R. 

Classification or class prediction of a test sample of an individual to differentiate as to 
whether said individual has Alzheimer's Disease or Manic Depression can be done using the 
25 differentially expressed genes as shown in Table 3R as the predictor genes in combination 
with well known statistical algorithms as would be understood by a person skilled in the art 
and described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Prediction are also available. 

30 RNA expression profiles 
EXAMPLE 5 
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In addition to methods to identify markers that distinguish between two diseases or 
conditions, this invention also includes methods to identify biomarkers specific for a group of 
three or more related diseases or conditions. The following three examples present methods 
to identify biomarkers for the following groups of diseases or conditions: cancer, 
5 cardiovascular disease and neurological disease, and the identified markers thereof. However 
the invention is not limited to these three groups of diseases or conditions. 



Cancer 

This example demonstrates the use of the claimed invention to identify biomarkers of 
10 cancer and use of same. 

As used herein "Cancer" is defined as any of the various types of malignant 
neoplasms, most of which invade surrounding tissues, may metastasize to several sites, and 
are likely to recur after attempted removal and to cause death of the patient unless adequately 
treated; especially, any such carcinoma or sarcoma, but, in ordinary usage, especially the 

15 former. In each case, the diagnosis of Cancer was corroborated by a skilled Board certified 
physician. RNA expression profilesRNA expression profiles of Whole blood samples from 
individuals who were identified as having cancer as described herein as compared with RNA 
expression profiles from individuals not having cancer, were generated using GeneSpring™ 
software analysis as described herein. Hybridizations to create said RNA expression profiles 

20 were done using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) 
platforms as described herein (data not shown). Various experiments were performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein, and those genes identified with a p value of < 0.05 as 
between the patients with cancer as compared with patients without cancer are shown in 

25 Table 6A. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with cancer can be done using the differentially expressed genes as 
shown in Table 6 A in combination with well known statistical algorithms for class prediction 
as would be understood by a person skilled in the art and is described herein. Commercially 
30 available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Prediction are also available. 
Cardiovascular Disease 
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This example demonstrates the use of the claimed invention to identify biomarkers of 
cardiovascular disease and use of same. 

As used herein in this example "Cardiovascular Disease" is defined as a disease 
affecting the heart or blood vessels. Cardiovascular diseases include coronary artery disease, 
hearart failure, and hypertension. In each case, the diagnosis of Cardiovascular Disease was 
corroborated by a skilled Board certified physician. RNA expression profiles of Whole blood 
samples from individuals who were identified as having Cardiovascular Disease as described 
herein as compared with RNA expression profiles from individuals not having 
Cardiovascular Disease, were generated using GeneSpring™ software analysis as described 
herein. Hybridizations to create said RNA expression profiles were done using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms as described 
herein (data not shown). Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein, and those genes identified with a p value of < 0.05 as between the patients 
with Cardiovascular Disease as compared with patients without Cardiovascular Disease are 
shown in Table 6B. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with Cardiovascular Disease can be done using the differentially 
expressed genes as shown in Table 6B in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein 



Neurological Diseases 

This example demonstrates the use of the claimed invention to identify biomarkers of 
Neurological Disease and use of same. 

As used herein "Neurological Disease" is defined as a disorder of the nervous 
system, and include disorders that involve the central nervous system (brain, brainstem and 
cerebellum), the peripheral nervous system (including cranial nerves), and the autonomic 
nervous system (parts of which are located in both central and peripheral nervous system). In 
particular neurological disease includes alzheimers', schizophrenia, and manic depression 
syndrome. In each case, the diagnosis of Neurological Disease was corroborated by a 
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skilled Board certified physician. Hybridizations to create said RNA expression profiles 
were done using the Asymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) 
platforms as described herein (data not shown). Various experiments were performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
5 statistical tests as described herein, and those genes identified with a p value of < 0.05 as 
between the patients with neurological Disease as compared with patients without 
neurological Disease are shown in Table 6C. 

Classification or class prediction of a test sample from an unknown patient in order to 
diagnose said individual with Neurological Disease can be done using the differentially 
10 expressed genes as shown in Table 6C in combination with well known statistical algorithms 
for class prediction as would be understood by a person skilled in the art and is described 
herein 



15 EXAMPLE 6 

In addition to methods to identify biomarkers that are associated with a specific group 
of diseases or conditions, another aspect of this invention includes methods to identify 
biomarkers that are associated with the administration of a specific drug or exogenous 
substance, or a specific grouping of drugs or exogenous substances thereof. In essence this 

20 aspect of the invention provides a method of providing an individuals drug signature. The 
administration of the exogenous substance(s) or drug(s) can be via any route and the instant 
methods of identifying these markers can be applied at any specifies time point(s) after said 
administration. The following examples illustrate embodiments of this drug signature aspect 
of the invention, but the invention is not limited to the methods comprising the drug(s) and 

25 exogenous substance(s), or groups of drugs amd exogenous substances illustrated below. 

Celebrex 1 * 

Celebrex versus other COX inhibitors^ 

This example demonstrates the use of the claimed invention to identify biomarkers 
30 associated with Celebrex R and use of same. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
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Celebrex R as compared to Whole blood samples taken from individuals who have been 
adminstered with any Cox inhibitor except Celebrex R . 

As used herein "Cox Inhibitor" is defined as anti-inflammatory drug that covalently 
modifies cyclooxygenases (Cox). RNA expression profiles from individuals who have been 

5 adminstered with Celebrex R were analyzed and compared to profiles from individuals who 
have been adminstered with any Cox inhibitor except Celebrex R . Preferably healthy 
individuals are chosen who are age and sex matched to said individuals being compared. 
Total mRNA from a blood sample is taken from each individual and isolated using TRIzol® 
reagent (GEBCO) and fluorescently labelled probes for each blood sample is generated as 

10 described above Hybridizations to create said RNA expression profiles were done using the 
Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms as described 
herein (data not shown). Various experiments were performed as outlined above, and 
analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein. Identification of genes differentially expressed in Whole blood samples 

15 from individuals who have been adminstered with Celebrex R as compared to individuals 
who have been adminstered with any Cox inhibitor except Celebrex R is determined by 
statistical analysis using the Wilcox Mann Whitney rank sum test using either the Wilcox 
Mann Whitney rank sum test or other statistical tests as described herein. Those differentially 
expressed genes identified with a p value of < 0.05 as between the individuals who have 

20 been adminstered with Celebrex R as compared to individuals who have been adminstered 
with any Cox inhibitor except Celebrex 1 ** are shown in Table 7A. 

Celebrex versus no Celebrex: 

25 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from individuals who have been adminstered with 
Celebrex R as compared to Whole blood samples taken from individuals who have been not 
been adminstered with Celebrex R . RNA expression profiles from individuals who have been 
adminstered with Celebrex R were analyzed and compared to profiles from individuals who 

30 have not been adminstered with Celebrex R . Preferably healthy individuals are chosen who 
are age and sex matched to said individuals being compared. Total mRNA from a blood 
sample is taken from each individual and isolated using TRIzol® reagent (GIBCO) and 
fluorescently labelled probes for each blood sample is generated as described above 
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Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms as described herein (data not 
shown). Various experiments were performed as outlined above, and analyzed using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein. 

5 Identification of genes differentially expressed in Whole blood samples from individuals 
who have been adminstered with Celebrex R as compared to individuals who have been not 
been adminstered with Celebrex R is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Those differentially expressed genes identified with ap 

10 value of < 0.05 as between the individuals who have been adminstered with Celebrex R as 
compared to individuals who have not been adminstered with Celebrex R ' are shown in Table 
7B. 

Vioxx R 

15 Vioxx R versus no Vioxx^i 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Vioxx R as compared to Whole blood samples taken from individuals who have been not 
been adminstered withVioxx R RNA expression profiles from individuals who have been 

20 adminstered with Vioxx R were analyzed and compared to profiles from individuals who have 
not been adminstered with Vioxx R . Preferably healthy individuals are chosen who are age 
and sex matched to said individuals being compared. Total mRNA from a blood sample is 
taken from each individual and isolated using TRIzol® reagent (GIBCO) and fluorescently 
labelled probes for each blood sample is generated as described above. Hybridizations to 

25 create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133AandU133 Plus 2.0) platforms as described herein (data not shown). Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 
Whitney rank sum test or other statistical tests as described herein. Identification of genes 
differentially expressed in Whole blood samples from individuals who have been 

30 adminstered with Vioxx R as compared to individuals who have been not been adminstered 
with Vioxx R is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
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herein. Those differentially expressed genes identified with a p value of < 0.05 as between 
the individuals who have been adminstered with Vioxx R as compared to individuals who 
have not been adminstered with Vioxx R are shown in Table 7C. 



5 Vioxx R versus other COX inhibitors 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Vioxx R as compared to Whole blood samples taken from individuals who have been 
adminstered with any Cox inhibitor except Vioxx R 

10 RNA expression profiles from individuals who have been adminstered with Vioxx 

were analyzed and compared to profiles from individuals who have been adminstered with 
any Cox inhibitor except Vioxx R Preferably healthy individuals are chosen who are age and 
sex matched to said individuals being compared. Total mRNA from a blood sample is taken 
from each individual and isolated using TRIzol® reagent (GIBCO) and fluorescently labelled 

15 probes for each blood sample is generated as described above. Hybridizations to create said 
RNA expression profiles were done using the Affymetrix® GeneChip® platforms (U133A 
and U133 Plus 2.0) platforms as described herein (data not shown). Various experiments 
were performed as outlined above, and analyzed using either the Wilcox Mann Whitney rank 
sum test or other statistical tests as described herein.. Identification of genes differentially 

20 expressed in Whole blood samples from individuals who have been adminstered with 
Vioxx R as compared to individuals who have been adminstered with any Cox inhibitor 
except Vioxx R is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein. Those differentially expressed genes identified with a p value of < 0.05 as between 

25 the individuals who have been adminstered with Vioxx R as compared to individuals who 
have been adminstered with any Cox inhibitor except Vioxx R are shown in Table 7D. 

Non-steroidal anti-inflammatory agents (NSAIDs) 

30 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from individuals who have been adminstered with 
non-steroidal anti-inflammatory agents as compared to Whole blood samples taken from 
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individuals who have been not been adminstered with non-steroidal anti-inflammatory 
agents. As defined herein, non-steroidal anti-inflammatory agents are defined as a large 
group of anti-inflammatory agents that work by inhibiting the production of prostaglandins. 
They exert anti-inflammatory, analgesic and antipyretic actions and include: ibuprofen, 
ketoprofen, piroxicam, naproxen, sulindac, aspirin, choline subsalicylate, diflunisal, 
fenoprofen, indomethacin, meclofenamate, salsalate, tolmetin and magnesium salicylate. Not 
icluded are steroidal compounds (such as hydrocortisone or prednisone) exerting anti- 
inflammatory activity. RNA expression profiles from individuals who have been adminstered 
with non-steroidal anti-inflammatory agents were analyzed and compared to profiles from 
individuals who have not been adminstered with non-steroidal anti-inflammatory agents. 
Preferably healthy individuals are chosen who are age and sex matched to said individuals 
being compared. Total mRNA from a blood sample is taken from each individual and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labelled probes for each blood 
sample is generated as described above. Hybridizations to create said RNA expression 
profiles were done using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 
2.0) platforms as described herein (data not shown). Various experiments were performed as 
outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Identification of genes differentially expressed in Whole 
blood samples from individuals who have been adminstered withnon-steroidal anti- 
inflammatory agents as compared to individuals who have been not been adminstered with 
non-steroidal anti-inflammatory agents is determined by statistical analysis using the Wilcox 
Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Those differentially expressed genes identified with a p 
value of < 0.05 as between the individuals who have been adminstered with non-steroidal 
anti-inflammatory agents as compared to individuals who have not been adminstered with 
non-steroidal anti-inflammatory agents are shown in Table 7E. 

Cortisone 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Cortisone as compared to Whole blood samples taken from individuals who have been not 
been adminstered with Cortisone. RNA expression profiles from individuals who have been 
adminstered with Cortisone were analyzed and compared to profiles from individuals who 
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have not been adminstered with Cortisone. Preferably healthy individuals are chosen who 
are age and sex matched to said individuals being compared. Total mRNA from a blood 
sample is taken from each individual and isolated using TRIzol® reagent (GIBCO) and 
fluorescently labelled probes for each blood sample is generated as described above. 
Hybridizations to create said RNA expression profiles were done using the Affymetrix® 
GeneChip® platforms (U133A and U133 Plus 2.0) platforms (U133A and U133 Plus 2.0) as 
described herein (data not shown). Various experiments were performed as outlined above, 
and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical tests as 
described herein.. Identification of genes differentially expressed in Whole blood samples 
from individuals who have been adminstered with Cortisone as compared to individuals who 
have been not been adminstered with Cortisone is determined by statistical analysis using the 
Wilcox Mann Whitney rank sum test using either the Wilcox Mann Whitney rank sum test or 
other statistical tests as described herein. Those differentially expressed genes identified with 
a p value of < 0.05 as between the individuals who have been adminstered with Cortisone 
as compared to individuals who have not been adminstered with Cortisone are shown in 
Table 7F. 

Visco Supplement 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 
Visco Supplement as compared to Whole blood samples taken from individuals who have 
been not been adminstered with Visco Supplement. RNA expression profiles from 
individuals who have been adminstered with Visco Supplement were analyzed and compared 
to profiles from individuals who have not been adminstered with Visco Supplement. 
Preferably healthy individuals are chosen who are age and sex matched to said individuals 
being compared. Total mRNA from a blood sample is taken from each individual and 
isolated using TRIzol® reagent (GIBCO) and fluorescently labelled probes for each blood 
sample is generated as described above. Hybridizations to create said RNA expression 
profiles were done using the Affymetrix® GeneChip® platforms platforms (U133A and 
U133 Plus 2.0) as described herein (data not shown). Various experiments were performed 
as outlined above, and analyzed using either the Wilcox Mann Whitney rank sum test or other 
statistical tests as described herein. Identification of genes differentially expressed in Whole 
blood samples from individuals who have been adminstered with Visco Supplement as 
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compared to individuals who have been not been adminstered with Visco Supplement is 
determined by statistical analysis using the Wilcox Mann Whitney rank sum test using either 
the Wilcox Mann Whitney rank sum test or other statistical tests as described herein. Those 
differentially expressed genes identified with a p value of < 0.05 as between the individuals 
5 who have been adminstered with Visco Supplement as compared to individuals who have 
not been adminstered with Visco Supplement are shown in Table 7G. 

Lipitor 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals who have been adminstered with 

L0 Lipitor as compared to Whole blood samples taken from individuals who have been not been 
adminstered with Lipitor. RNA expression profiles from individuals who have been 
adminstered with Lipitor were analyzed and compared to profiles from individuals who have 
not been adminstered with Lipitor. Preferably healthy individuals are chosen who are age 
and sex matched to said individuals being compared. Total mRNA from a blood sample is 

15 taken from each individual and isolated using TRIzol® reagent (GIBCO) and fluorescently 
labelled probes for each blood sample is generated as described above. Hybridizations to 
create said RNA expression profiles were done using the Affymetrix® GeneChip® platforms 
(U133A and U133 Plus 2.0) platforms as described herein (data not shown). Various 
experiments were performed as outlined above, and analyzed using either the Wilcox Mann 

20 Whitney rank sum test or other statistical tests as described herein. Identification of genes 
differentially expressed in Whole blood samples from individuals who have been 
adminstered with Lipitor as compared to individuals who have been not been adminstered 
with Lipitor is determined by statistical analysis using the Wilcox Mann Whitney rank sum 
test using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 

25 herein. Those differentially expressed genes identified with a p value of < 0.05 as between 
the individuals who have been adminstered with Lipitor as compared to individuals who 
have not been adminstered with Lipitor are shown in Table 7H. 

Smoking 

This example demonstrates the use of the claimed invention to detect differential gene 
30 expression in Whole blood samples taken from individuals who have smoked cigarettes and 
cigars as compared to Whole blood samples taken from individuals who have not smoked 
cigarettes and cigars . RNA expression profiles from individuals who have smoked were 
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analyzed and compared to profiles from individuals who have not smoked. Preferably 
healthy individuals are chosen who are age and sex matched to said individuals being 
compared. Total tnRNA from a blood sample is taken from each individual and isolated 
using TRIzol® reagent (GIBCO) and fluorescently labelled probes for each blood sample is 
5 generated as described above. Hybridizations to create said RNA expression profiles were 
done using the Affymetrix® GeneChip® platforms (U133A and U133 Plus 2.0) platforms 
as described herein (data not shown). Various experiments were performed as outlined 
above, and analyzed using either the Wilcox Mann Whitney rank sum test or other statistical 
tests as described herein. Identification of genes differentially expressed in Whole blood 

10 samples from individuals who have smoked as compared to individuals who have not 

smoked is determined by statistical analysis using the Wilcox Mann Whitney rank sum test 
using either the Wilcox Mann Whitney rank sum test or other statistical tests as described 
herein. Those differentially expressed genes identified with a p value of < 0.05 as between 
the individuals who have smoked as compared to individuals who have not smoked are 

1 5 shown in Table 71. 

Example 7. 

Identification of Genes Specific for OA Only by Removing Genes Relevant to Co- 
Morbidities and Other Disease States. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in blood unique to Osteoarthritis as compared with other disease states. 

Whole blood samples were taken from patients who were diagnosed with mild OA or 
severe OA and compared with individuals who were identified as normal individuals as 
defined herein. RNA expression profiles were then analysed to identify genes which are 
differentially expressed in OA as compared with normal. In each case, the diagnosis of OA 
25 was corroborated by a qualified physician. 

Total mRNA from a blood sample taken from each patient was isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample were 
generated as described above. Each probe was denatured and hybridized to a ChondroChip™ 
as described herein. Identification of genes differentially expressed in Whole blood samples 
30 from patients with mild or severe OA as compared to healthy patients was determined by 
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statistical analysis using the Weltch ANOVA test (Michelson and Schofield, 1996). 
(Dendogram analysis not shown). 



In order to identify genes differentially expressed in blood unique to OA but not 
differentially expressed as a result of possible co-morbidities including hypertension, obesity, 

5 asthma, taking systemic steroids, or allergies, genes identified as differentially expressed in 
both OA and any of the genes identified as differentially expressed as a result of co- 
morbidity, e.g., Table 1A (co-morbidity of OA and hypertension v. normal), Table IB (co- 
morbidity of OA and obesity v. normal), Table 3C (co-morbidity of OA and allergy v. 
normal), Table 3D (co-morbidity of OA and taking systemic steroids v. normal), and genes in 

10 common with people identified as having asthma and OA (Table 3 AA) were removed. 
Similarly any genes and unique to obesity (Table 3R), hypertension (Table 3P), allergies 
(Table 3T), systemic steroids (Table 3 V) were also removed. As a result of these 
comparisons, a list of genes unique to individuals with OA were identified. The identity of 
the differentially expressed genes is shown in Table 3AB. 

15 It would be clear to a person skilled in the art that rather than simply remove those 

genes which are relevant to other disease states, one could use a more refined analysis and 
remove those genes which show the same trend in gene expression, e.g. remove those genes 
which show up regulation in a co-morbid state and also show up-regulation in the single 
disease state, but retain those genes which show a different trend in gene expression e.g. 

20 retain those genes which show up regulation in a co-morbid state as compared to down 
regulation in a single disease state. 

Classification or class prediction of a test sample of an individual to determine 
whether said individual has OA or does not have OA can be done using the differentially 
expressed genes as shown in Table 3 AB, irrespective of whether the individual presents with 
25 co-morbidity using well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Brain Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
30 having brain cancer as compared with RNA expression profiles from normal individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with brain cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "brain cancer" refers to all forms of primary brain tumors, both 
5 intracranial and extracranial and includes one or more of the following: Glioblastoma, 

Ependymoma, Gliomas, Astrocytoma, Medulloblastoma, Neuroglioma, Oligodendroglioma, 
Meningioma, Retinoblastoma, and Craniopharyngioma. 

Whole blood samples are taken from patients diagnosed with brain cancer as defined 
herein. RNA expression profiles are then analysed and compared to profiles from patients 
10 unaffected by any disease. Preferably healthy patients are chosen who are age and sex 

matched to said patients diagnosed with disease. In each case, the diagnosis of brain cancer 
is corroborated by a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample are 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with brain cancer as compared 
to healthy patients is determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA. Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has brain cancer or does not having brain cancer can be done using 
the differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
25 skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Prostate Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having prostate cancer as compared with RNA expression profiles from normal individuals. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with prostate cancer as 
compared to Whole blood samples taken from healthy patients 

As used herein "prostate cancer" refers to a malignant cancer originating within the 
prostate gland. Patients identified as having prostate cancer can have any stage of prostate 
cancer, as determined clinically (by digital rectal exam or PSA testing) and or pathologically. 
Staging of prostate cancer can done in accordance with TNM or the Staging System of the 
American Joint Committee on Cancer (AJCC). In addition to the TNM system, other systems 
may be used to stage prostate cancer, for example, the Whitmore-Jewett system. 

Whole blood samples are taken from patients diagnosed with prostate cancer as 
defined herein. RNA expression profiles are then analysed and compared to profiles from 
patients unaffected by any disease to identify genes which differentiate as between the two 
groups. Similarly RNA expression profiles can be analysed so as to differentiate as between 
the severity of the prostate cancer. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease or with a specific stage of said disease. 
In each case, the diagnosis of prostate cancer is corroborated by a skilled Board certified 
physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with prostate cancer as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has prostate cancer, has a specific stage of prostate cancer, or does 
not having prostate cancer can be done using the differentially expressed genes identified as 
described above as the predictor genes in combination with well known statistical algorithms 
as would be understood by a person skilled in the art and described herein. Commercially 
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available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
Predication are also available. 

Ovarian Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
5 having ovarian cancer as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with ovarian cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "ovarian cancer" refers to a malignant cancerous growth originating 
10 within the ovaries. Patients identified as having ovarian cancer can have any stage of ovarian 
cancer. Staging is done by combining information from imaging tests with the results of a 
surgical examination done during a laprotomy. Numbered stages I to IV are used to describe 
the extent of the cancer and whether it has spread (metastasized) to more distant organs. 

Whole blood samples are taken from patients diagnosed with ovarian cancer, or with a 
specific stage of ovarian cancer as defined herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease or with a 
specific stage of said disease. In each case, the diagnosis of ovarian cancer is corroborated by 
a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with ovarian cancer and or a 
specific stage of ovarian cancer as compared to healthy patients is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. Primer of Biostatistics. 
5th ed. New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine whether said 
individuals has ovarian cancer, has a specific stage of ovarian cancer or does not having 
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ovarian cancer can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for 
5 Class Predication are also available. 

Gastric Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having gastric cancer as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
10 expression in Whole blood samples taken from patients diagnosed with gastric cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "gastric or stomach cancer" refers to a cancerous growth originating 
within the stomach and includes gastric adenocarcinoma, primary gastric lymphoma and 
gastric nonlymphoid sarcoma. Patients identified as having stomac can also be categorized 
15 by stage of said cancer as determined by the System of the American Joint Committee on 
Cancer (AJCC). 

Whole blood samples are taken from patients diagnosed with stomach cancer, or with 
a specific stage of stomach cancer as defined herein. RNA expression profiles are then 

analysed and compared to profiles from patients unaffected by any disease. Preferably 

i 

20 healthy patients are chosen who are age and sex matched to said patients diagnosed with 

disease or with a specific stage of said disease. In each case, the diagnosis of stomach cancer 
is corroborated by a skilled Board certified physician. 

Total nxRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GD3CO) and fluorescently labeled probes for each blood sample is 

25 generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with stomach cancer and or a 
specific stage of stomach cancer as compared to healthy patients is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 

30 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 
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Classification or class prediction of a test sample of an individual to determine 
whether said individuals has stomach cancer, has a specific stage of stomach cancer or does 
not having stomach cancer can be done using the differentially expressed genes identified 
as described above as the predictor genes in combination with well known statistical 
5 algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

Breast Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
10 having breast cancer as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with breast cancer as 
compared to Whole blood samples taken from healthy patients. 

As used herein "breast cancer" refers to a cancerous growth originating within the 
15 breast and includes invasive and non invasive breast cancer such as ductal carcinoma in situ 
(DCIS), lobular carcinoma in situ (LCIS), infiltrating ductal carcinoma, and infiltrating 
lobular carcinoma. Patients identified as having breast cancer can also be categorized by 
stage of said cancer as determined by the System of the American Joint Committee on Cancer 
(AJCC) or TNM classification. 

20 Whole blood samples are taken from patients diagnosed with breast cancer, or with a 

specific stage of breast cancer as defined herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease or with a 
specific stage of said disease. In each case, the diagnosis of breast cancer is corroborated by 

25 a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
30 differentially expressed in Whole blood samples from patients with breast cancer and or a 
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specific stage of breast cancer as compared to healthy patients is determined by statistical 
analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 
5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
5 whether said individuals has breast cancer, has a specific stage of breast cancer or does not 
have breast cancer can be done using the differentially expressed genes identified as 
described above as the predictor genes in combination with well known statistical 
algorithms as would be understood by a person skilled in the art and described herein. 
Commercially available programs such as those provided by Silicon Genetics (e.g. 
10 GeneSpring™) for Class Predication are also available. 

Nasopharyngeal Cancer 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having nasopharyngeal cancer as compared with RNA expression profiles from normal 
individuals. 

15 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from patients diagnosed with nasopharyngeal 
cancer as compared to Whole blood samples taken from healthy patients. 

As used herein "nasopharyngeal cancer" refers to a cancerous growth arising from the 
epithelial cells that cover the surface and line the nasopharynx. Patients identified as having 
20 nasopharyngeal cancer can also be categorized by stage of said cancer as determined by the 
System of the American Joint Committee on Cancer (AJCC) or TNM classification. 

Whole blood samples are taken from patients diagnosed with nasopharyngeal cancer, 
or with a specific stage of nasopharyngeal cancer as defined herein. RNA expression profiles 
are then analysed and compared to profiles from patients unaffected by any disease. 
25 Preferably healthy patients are chosen who are age and sex matched to said patients 

diagnosed with disease or with a specific stage of said disease. In each case, the diagnosis of 
nasopharyngeal cancer is corroborated by a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample is 
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generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with nasopharyngeal cancer 
and or a specific stage of breast cancer as compared to healthy patients is determined by 
5 statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz S A, Primer of 
Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has nasopharyngeal cancer, has a specific stage of nasopharyngeal 
cancer or does not have nasopharyngeal cancer can be done using the differentially 
10 expressed genes identified as described above as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Guillain Barre syndrome 

15 Analysis of RNA expression profiles of Whole blood samples from individuals 

having Guillain Barre syndrome as compared with RNA expression profiles from normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with Guillain Barre 
20 syndrome as compared to Whole blood samples taken from healthy patients. 

As used herein "Guillain Barre syndrome" refers to an acute, usually rapidly 
progressive form of inflammatory polyneuropathy characterized by muscular weakness and 
mild distal sensory loss. 

Whole blood samples are taken from patients diagnosed with Guillain Barre 
25 syndrome as defined herein. RNA expression profiles are then analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who 
are age and sex matched to said patients diagnosed with disease. In each case, the diagnosis 
of Guillain Barre syndrome is corroborated by a skilled Board certified physician. 
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Total mRNA from a blood sample is taken from each patient and isolated using 
TRlzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
5 differentially expressed in Whole blood samples from patients with Guillain Barre syndrome 
as compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
0 whether said individuals has Guillain Barre syndrome, or does not have Guillain Barre 
syndrome can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as 
would be understood by a person skilled in the art and described herein. Commercially 
available programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for 
15 Class Predication are also available. 

Fibromyalgia 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having Fibromyalgia as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
20 expression in Whole blood samples taken from patients diagnosed with Fibromyalgia as 
compared to Whole blood samples taken from healthy patients. 

As used herein "Fibromyalgia" refers to widespread chronic musculoskeletal pain and 
fatigue. The pain comes from the connective tissues, such as the muscles, tendons, and 
ligaments and does not involve the joints. Whole blood samples are taken from patients 
25 diagnosed with Fibromyalgia as defined herein. RNA expression profiles are then analysed 
and compared to profiles from patients unaffected by any disease. Preferably healthy patients 
are chosen who are age and sex matched to said patients diagnosed with disease. In each 
case, the diagnosis of Fibromyalgia is corroborated by a skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
30 TRlzol® reagent (GEBCO) and fluorescently labeled probes for each blood sample is 
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generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
differentially expressed in Whole blood samples from patients with Fibromyalgia as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
5 Whitney rank sum test (Glantz SA. Primer of Biostatistics. 5th ed. New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has Fibromyalgia, or does not have Fibromyalgia can be done 
using the differentially expressed genes identified as described above as the predictor genes 
10 in combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Multiple Sclerosis 

Analysis of RNA expression profiles of Whole blood samples from individuals 
15 having Multiple Sclerosis as compared with RNA expression profiles from normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with Multiple Sclerosis as 
compared to Whole blood samples taken from healthy patients. 

20 As used herein "Multiple Sclerosis" refers to chronic progressive nervous disorder 

involving the loss of myelin sheath surrounding certain nerve fibres. Whole blood samples 
are taken from patients diagnosed with Multiple Sclerosis as defined herein. RNA expression 
profiles are then analysed and compared to profiles from patients unaffected by any disease. 
Preferably healthy patients are chosen who are age and sex matched to said patients 

25 diagnosed with disease. In each case, the diagnosis of Multiple Sclerosis is corroborated by a 
skilled Board certified physician. 

Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
30 U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
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differentially expressed in Whole blood samples from patients with Multiple Sclerosis as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has Multiple Sclerosis, or does not have Multiple Sclerosis can be 
done using the differentially expressed genes identified as described above as the predictor 
genes in combination with well known statistical algorithms as would be understood by a 
person skilled in the art and described herein. Commercially available programs such as 
those provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also 
available. 

Muscular Dystrophy 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having Muscular Dystrophy as compared with RNA expression profiles from normal 
individuals. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with Muscular Dystrophy 
as compared to Whole blood samples taken from healthy patients. 

As used herein "Muscular Dystrophy" refers to a hereditary disease of the muscular 
system characterized by weakness and wasting of the skeletal muscles. Muscular Dystrophy 
includes Duchennes' Muscular Dystrophy, limb-girdle muscular dystrophy, myotonia 
atrophica, myotonic muscular dystrophy, pseudohypertrophic muscular dystrophy, and 
Steinhardt's disease. 

Whole blood samples are taken from patients diagnosed with Muscular Dystrophy as 
defined herein. RNA expression profiles are then analysed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of Muscular 
Dystrophy is corroborated by a skilled Board certified physician. 
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Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIB CO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
5 differentially expressed in Whole blood samples from patients with Muscular Dystrophy as 
compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA,. Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
L0 whether said individuals has Muscular Dystrophy, or does not have Muscular Dystrophy 
can be done using the differentially expressed genes identified as described above as the 
predictor genes in combination with well known statistical algorithms as would be 
understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
15 Predication are also available. 

Septic Joint Arthroplasty 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having septic joint arthroplasty as compared with RNA expression profiles from normal 
individuals. 

20 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from patients diagnosed with septic joint 
arthroplasty as compared to Whole blood samples taken from healthy patients. 

As used herein "septic joint arthroplasty" refers to an inflammation of the joint caused 
by a bacterial infection. 

25 Whole blood samples are taken from patients diagnosed with septic joint arthroplasty 

as defined herein. RNA expression profiles are then analysed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of septic 
joint arthroplasty is corroborated by a skilled Board certified physician. 
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Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. I dentification of genes 
5 differentially expressed in Whole blood samples from patients with septic joint arthroplasty 
as compared to healthy patients is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed. ? New York, USA: 
McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
10 whether said individuals has septic joint arthroplasty, or does not have septic joint 

arthroplasty can be done using the differentially expressed genes identified as described 
above as the predictor genes in combination with well known statistical algorithms as would 
be understood by a person skilled in the art and described herein. Commercially available 
programs such as those provided by Silicon Genetics (e.g. GeneSpring™) for Class 
15 Predication are also available. 

Hepatitis 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having hepatitis as compared with RNA expression profiles from normal individuals. 

This example demonstrates the use of the claimed invention to detect gene expression 
in Whole blood samples taken from patients diagnosed with hepatitis as compared to Whole 
blood samples taken from healthy patients. As used herein "hepatitis" refers to an 
inflammation of the liver caused by a virus or toxin and can include hepatitis A, hepatitis B, 
hepatitis C, hepatitis D, hepatitis E, and hepatitis F. Whole blood samples are taken from 
patients diagnosed with hepatitis as defined herein. RNA expression profiles are then 
analysed and compared to profiles from patients unaffected by any disease. Preferably 
healthy patients are chosen who are age and sex matched to said patients diagnosed with 
disease. In each case, the diagnosis of hepatitis is corroborated by a skilled Board certified 
physician.Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GIBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 
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differentially expressed in Whole blood samples from patients with hepatitis as compared to 
healthy patients is determined by statistical analysis using the Wilcox Mann Whitney rank 
sum test (Glantz SA ? Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical 
Publishing Division, 2002). 

5 Classification or class prediction of a test sample of an individual to determine 

whether said individuals has hepatitis, or does not have hepatitis can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 

0 provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Malignant Hyperthermia Susceptibility 

Analysis of RNA expression profiles of Whole blood samples from individuals 
having Malignant Hyperthermia Susceptibility as compared with RNA expression profiles 
from normal individuals. 

15 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from patients diagnosed with Malignant 
Hyperthermia Susceptibility as compared to Whole blood samples taken from healthy 
patients. As used herein "Malignant Hyperthermia Susceptibility" refers to a 
pharmacogenetic disorder of skeletal muscle calcium regulation often developing during or 

20 after a general anaesthesia. 

Whole blood samples are taken from patients diagnosed with Malignant Hyperthermia 
Susceptibility as defined herein. RNA expression profiles are then analysed and compared to 
profiles from patients unaffected by any disease. Preferably healthy patients are chosen who 
are age and sex matched to said patients diagnosed with disease. In each case, the diagnosis 

25 of Malignant Hyperthermia Susceptibility is corroborated by a skilled Board certified 

physician. Total mRNA from a blood sample is taken from each patient and isolated using 
TRIzol® reagent (GEBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. Identification of genes 

30 differentially expressed in Whole blood samples from patients with Malignant Hyperthermia 
Susceptibility as compared to healthy patients is determined by statistical analysis using the 
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Wilcox Mann Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, 
USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has Malignant Hyperthermia Susceptibility, or does not have 
5 Malignant Hyperthermia Susceptibility can be done using the differentially expressed genes 
identified as described above as the predictor genes in combination with well known 
statistical algorithms as would be understood by a person skilled in the art and described 
herein. Commercially available programs such as those provided by Silicon Genetics (e.g. 
GeneSpring™) for Class Predication are also available. 

0 Osteoarthritis Horses 

Analysis of RNA expression profiles of Whole blood samples from horses having 
osteoarthritis as compared with RNA expression profiles from normal or non-osteoarthritic 
horses. 

This example demonstrates the use of the claimed invention to detect differential gene 
15 expression in Whole blood samples taken from horses so as to diagnose equine arthritis as 
compared to Whole blood samples taken from healthy horses. 

As used herein "arthritis" in reference to horses refers to a degenerative joint disease 
that affects horses by causing lameness. Although it can appear in any joint, most common 
areas are the upper knee joint, front fetlocks, hocks, or coffin joints in the front feet. The 

20 condition can be caused by trauma, mineral or dietary deficiency, old age, poor conformation, 
over exertion or infection. The different structures that can be damaged in arthritis are the 
cartilage inside joints, the bone in the joints, the joint capsule, the synovial membranes, the 
ligaments around the joints and lastly the fluid that lubricates the insides of 6 synovial joints'. 
In severe cases all of these structures are affected, hi for example osteochondrosis only the 

25 cartilage may be affected. 

Regardless of the cause, the disease begins when the synovial fluid that lubricates 
healthy joints begins to thin. The decrease in lubrication causes the cartilage cushion to break 
down, and eventually the bones begin to grind painfully against each other. Diagnostic tests 
used to confirm arthritis include X-rays, joint fluid analysis, and ultrasound. 
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Whole blood samples are taken from horses diagnosed with arthritis as defined herein. 
RNA expression profiles are then analysed and compared to profiles from horses unaffected 
by any disease. Preferably healthy horses are chosen who are age and sex matched to said 
horses diagnosed with disease. In each case, the diagnosis of arthritis is corroborated by a 
5 certified veterinarian. 

Total mRNA from a blood sample is taken from each horse and isolated using 
TRIzol® reagent (GDBCO) and fluorescently labeled probes for each blood sample is 
generated as described above. Each probe is denatured and hybridized to a Affymetrix 
U133A Chip and/or a ChondroChip™ as described herein. An equine specific microarray 
10 representing the equine genome can also be used. Identification of genes differentially 

expressed in Whole blood samples from horses with arthritis as compared to healthy horses is 
determined by statistical analysis using the Wilcox Mann Whitney rank' sum test (Glantz SA, 
Primer of Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 
2002). 

15 Classification or class prediction of a test sample of a horse to determine whether 

said horse has arthritis or does not have arthritis can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with 
well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 

20 Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Osteoarthritic Dogs 

Analysis of RNA expression profiles of Whole blood samples from dogs having 
osteoarthritis as compared with RNA expression profiles from normal or non-osteoarthritic 
dogs. 

25 This example demonstrates the use of the claimed invention to detect differential gene 

expression in Whole blood samples taken from dogs so as to diagnose equine arthritis as 
compared to Whole blood samples taken from healthy horses. 

As used herein "osteoarthritis" in reference to dogs is a form of degenerative joint 
disease which involves the deterioration of and changes to the cartilage and bone. In 
30 response to inflammation in and about the joint, the body responds with bony remodeling 
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around the joint structure. This process can be slow and gradual with minimal outward 
symptoms, or more rapidly progressive with significant pain and discomfort. Osteoarthritic 
changes can occur in response to infection and injury of the joint as well. 

Whole blood samples are taken from dogs diagnosed with osteoarthritis as defined 
5 herein. RNA expression profiles are then analysed and compared to profiles from dogs 
unaffected by any disease. Preferably healthy dogs are chosen who are age, sex and breed 
matched to said dogs diagnosed with disease. In each case, the diagnosis of osteoarthritis is 
corroborated by a certified veterinarian. 

Total mRNA from a blood sample is taken from each dog and isolated using TRIzol® 
0 reagent (GIB CO) and fluorescently labeled probes for each blood sample is generated as 
described above. Each probe is denatured and hybridized to a Affymetrix U133A Chip 
and/or a ChondroChip™ as described herein. A canine specific microarray representing the 
canine genome can also be used. Identification of genes differentially expressed in Whole 
blood samples from dogs with osteoarthritis as compared to healthy horses is determined by 
15 statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA, Primer of 
Biostatistics, 5th ed., New York, USA: McGraw-Hill Medical Publishing Division, 2002). 

Classification or class prediction of a test sample of a dog to determine whether said 
dog has osteoarthritis or does not have osteoarthritis can be done using the differentially 
expressed genes identified as described above as the predictor genes in combination with 
20 well known statistical algorithms as would be understood by a person skilled in the art and 
described herein. Commercially available programs such as those provided by Silicon 
Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Manic Depression Syndrome (MDS) as compared with Schizophrenia RNA expression 
profiles 

25 Analysis of RNA expression profiles of Whole blood samples from individuals 

having Manic Depression Syndrome (MDS) as compared with RNA expression profiles from 
individuals having Schizophrenia. 

This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from patients diagnosed with MDS as compared to 
30 Whole blood samples taken from schizophrenic patients. 
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As used herein "Manic Depression Syndrome (MDS)" refers to a mood disorder 
characterized by alternating mania and depression. As used herein, "schizophrenia" is 
defined as a psychotic disorders characterized by distortions of reality and disturbances of 
thought and language and withdrawal from social contact. Patients diagnosed with 

5 "schizophrenia" can include patients having any of the following diagnosis: an acute 

schizophrenic episode, borderline schizophrenia, catatonia, catatonic schizophrenia, catatonic 
type schizophrenia, disorganized schizophrenia, disorganized type schizophrenia, 
hebephrenia, hebephrenic schizophrenia, latent schizophrenia, paranoic type schizophrenia, 
paranoid schizophrenia, paraphrenia, paraphrenic schizophrenia, psychosis, reactive 

0 schizophrenia or the like. 

Whole blood samples are taken from patients diagnosed with MDS or Schizophrenia 
as defined herein. RNA expression profiles are then analyzed and compared to profiles from 
patients unaffected by any disease. Preferably healthy patients are chosen who are age and 
sex matched to said patients diagnosed with disease. In each case, the diagnosis of MDS and 
L5 Schizophrenia is corroborated by a skilled Boardcertified physician. Total mRNA from a 
blood sample is taken from each patient and isolated using TRIzol* reagent (GIBCO) and 
fluorescently labeled probes for each blood sample is generated as described above. 

Each probe is denatured and hybridized to a Affymetrix U133A Chip and/or a 
ChondroChip(tm) as described herein. Identification of genes differentially expressed in 

20 Whole blood samples from patients with MDS as compared to Schizophrenic patients as 
compared to normal individuals is determined by statistical analysis using the Wilcox Mann 
Whitney rank sum test (Glantz SA, Primer of Biostatistics, 5th ed., New York, USA: 
McGraw-Hill Medical Publishing Division, 2002) (data not shown). 294 genes were 
identified as being differentially expressed with a p value of < 0.05 as between the 

25 schizophrenic patients, the MDS patients and those control individuals. The identity of the 
differentially expressed genes is shown in Table SAC. 

Classification or class prediction of a test sample of an individual to determine 
whether said individuals has MDS, has Schizophrenia or is normal can be done using the 
differentially expressed genes identified as described above as the predictor genes in 
30 combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring (tm) ) for Class Predication are also available. 
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Prediction of Progression of Osteoarthritis. 

Analysis of RNA expression profiles of Whole blood samples of Individuals So as to 
5 Predict Progression of Osteoarthritis. 

This example demonstrates the use of the claimed invention to predict the progression 
of Osteoarthritis. 

As used herein "osteoarthritis" is a form of degenerative joint disease which involves 
the deterioration of and changes to the cartilage and bone. In response to inflammation in and 
10 about the joint, the body responds with bony remodeling around the joint structure. This 
process can be slow and gradual with minimal outward symptoms, or more rapidly 
progressive with significant pain and discomfort. Osteo arthritic changes can occur in 
response to infection and injury of the joint as well. 

Whole blood samples are taken from test individuals not having any symptoms of 
15 osteoarthritis and RNA expression profiles are then analyzed and compared to profiles from 
individuals having mild osteoarthritis. 

Classification or class prediction of a test sample of said individual to determine 
whether said individual has mild osteoarthritis or does not have osteoarthritis can be done 
using the differentially expressed genes identified as described herein as the predictor genes 
20 in combination with well known statistical algorithms as would be understood by a person 
skilled in the art and described herein. Commercially available programs such as those 
provided by Silicon Genetics (e.g. GeneSpring™) for Class Predication are also available. 

Individuals identified with mild osteoarthritis as a result of this classification have a 
25 significantly greater chance of developing moderate, marked and/or severe osteoarthris than 
those individuals not diagnosed with mild osteoarthritis. 

Therapy 

Microarray Data Analysis of RNA expression profiles of Whole blood samples from 
individuals having a condition as compared with RNA expression profiles from individuals 
30 not having said condition, and wherein said individual is undergoing therapeutic treatment in 
light of said condition. 
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This example demonstrates the use of the claimed invention to detect differential gene 
expression in Whole blood samples taken from individuals undergoing therapeutic treatment 
of a condition as compared with RNA expression profiles from individuals not undergoing 
treatment. 

5 Whole blood samples are taken from patients who are undergoing therapeutic 

treatment. RNA expression profiles are then analysed and compared to profiles from patients 
not undergoing treatment. 

Total mRNA from a blood sample taken from each patient is isolated using TRIzol® 
reagent (GIBCO) and fluorescently labeled probes for each blood sample are generated as 
10 described above. Each probe is denatured and hybridized to a microarray for example the 

15K Chondrogene Microarray Chip (ChondroChip™), Affymetrix Genechip or Blood chip as 
described herein. 

Identification of genes differentially expressed in Whole blood samples from patients 
undergoing therapeutic treatment as compared to patients not undergoing treatment is 
15 determined by statistical analysis using the Wilcox Mann Whitney rank sum test (Glantz SA. 
Primer of Biostatistics., differentially expressed genes are then identified as being 
differentially expressed with a p value of < 0.05. 

One skilled in the art will appreciate readily that the present invention is well adapted 
to carry out the objects and obtain the ends and advantages mentioned, as well as those 
20 objects, ends and advantages inherent herein. 

The present examples, along with the methods, procedures, treatments, molecules, 
and specific compounds described herein are presently representative of preferred 
embodiments, are exemplary, and are not intended as limitations on the scope of the 
invention. 

25 Changes therein and other uses will occur to those skilled in the art which are 

encompassed within the spirit of the invention as defined by the scope of the claims. 

All patents, patent applications, and published references cited herein are hereby 
incorporated by reference in their entirety. While this invention has been particularly shown 
and described with references to preferred embodiments thereof, it will be understood by 
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those skilled in the art that various changes in form and details may be made therein without 
departing from the scope of the invention encompassed by the appended claims. 
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Wliat is claimed is: 



5 1 . A method of identifying one or more biomarkers for a disease of interest, wherein each of 
said one or more biomarkers corresponds to an RNA transcript, comprising the method 
steps of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said disease of interest, wherein each of said one or 

10 more RNA transcripts is a candidate biomarker for said disease of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals not having said disease of interest, 

wherein the RNA transcripts which display differing levels in the comparison of step 
15 b), are identified as being biomarkers for said disease of interest, 

and/or 

c) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said disease of interest, wherein each of said one or 
more transcripts is a candidate biomarker for said disease of interest; and 
20 d) comparing the level of each of said one or more RNA transcripts from said step c) 

with the level of each of said one or more transcripts in blood obtained from one or more 
individuals having said disease of interest, 

wherein the RNA transcripts which display the same levels in the comparison of step 
d), are identified as being biomarkers for said disease of interest. 

25 

2. The method of claim 1, wherein the disease is selected from the group consisting of: liver 
cancer, bladder cancer, brain cancer, prostate cancer, ovarian cancer, kidney cancer, gastric 
cancer, lung cancer, breast cancer, nasopharyngeal cancer, pancreatic cancer, osteoarthritis, 
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depression, hypertension, heart failure, obesity, rheumatoid arthritis, hyperlipidemia, lung 
disease, chagas disease, allergies, schizophrenia, asthma, manic depression syndrome, 
ankylosing spondylitis, guillain barre syndrome, fibromyalgia, multiple sclerosis, muscular 
dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or colitis, or malignant 
5 hyperthermia susceptibility, psoriasis, thyroid disorder, irritable bowel syndrome, 
osteoporosis, migraines, eczema, or a heart murmer. 

3 . A method of identifying one or more biomarkers of a stage of progression or regression 
of a disease of interest, wherein each of said one or more biomarkers corresponds to an 
RNA transcript, comprising the steps of: 

10 a) determining the level of one or more RNA transcripts expressed in blood obtained 

from one or more individuals having a stage of said disease of interest, wherein said one 
or more individuals are at the same progressive or regressive stage of said disease of 
interest, and wherein each of said one or more transcripts is a candidate biomarker for 
determining the stage of progression or regression of said disease of interest, and; 

1 5 b) comparing the level of each of said one or more RNA transcripts from said step a) 

with the level of each of said one or more transcripts in blood obtained from one or more 
individuals who are at a progressive or regressive stage of said disease of interest distinct 
from that of said one or more individuals of step a), 

wherein those compared transcripts which display differing levels in the comparison 

20 of step b) are identified as being biomarkers for the stage of progression or regression of 
said disease of interest, 

and/or 

c) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having a stage of said disease of interest, wherein said one or 

25 more individuals are at the same progressive or regressive stage of said disease of interest, 
and wherein each of said one or more transcripts is a candidate biomarker for determining the 
stage of progression or regression of said disease of interest, and; 

d) comparing the level of each of said one or more RNA transcripts from said step c) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 

30 more individuals who are at a progressive or regressive stage of said disease of interest as 
that of said one or more individuals of step c), 

wherein those compared transcripts which display the same levels in the comparison 
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of step d) are identified as being biomarkers for the stage of progression or regression of said 
disease of interest. 



4. The method of claim 3, wherein said disease of interest is selected from the group 
5 consisting of: liver cancer, bladder cancer, brain cancer, prostate cancer, ovarian cancer, 
kidney cancer, gastric cancer, lung cancer, breast cancer, nasopharyngeal cancer, pancreatic 
cancer, osteoarthritis, depression, hypertension, heart failure, obesity, rheumatoid arthritis, 
hyperlipidemia, lung disease, chagas disease, allergies, schizophrenia and asthma, manic 
depression syndrome, ankylosing spondylitis, guillain barre syndrome, fibromyalgia, multiple 
10 sclerosis, muscular dystrophy, septic joint arthroplasty, hepatitis, Crohn's disease or colitis, 
malignant hyperthermia susceptibility, psoriasis, thyroid disorder, irritable bowel syndrome, 
osteoporosis, migraines, eczema, or a heart murmer, alzheimer's, CAD, Diabetes, or 
colorectal cancer. 

15 5 . A method of identifying one or more biomarkers for a condition of interest, wherein 
each of said one or more biomarkers corresponds to an RNA transcript, comprising the steps 
of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said condition of interest, wherein each of said one or 

20 more transcripts is a candidate biomarker for said condition of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals not having said condition of interest, 

wherein those compared transcripts which display differing levels in the comparison 
25 of step b) are identified as being biomarkers for said condition of interest, 

and/or 

c) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said condition of interest, wherein each of said one or 
more transcripts is a candidate biomarker for said condition of interest; and 

30 d) comparing the level of each of said one or more RNA transcripts from said step c) 
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with the level of each of said one or more RNA transcripts in blood obtained from one or 
more individuals having said condition of interest, 

wherein those compared transcripts which display the same levels in the comparison 
of step d) are identified as being biomarkers for said condition of interest. 

5 

6. The method of claim 5, wherein said condition of interest is the condition resulting 
from the administration of medication wherein said medication is selected from the group 
consisting of: Celebrex, Vioxx, NSAIDS, Cortozone, Hyaluronic Acid, Systemic Steroids, 
hormone replacement therapy, pregnazone and birth control pills. 

10 7. The method of claim 5, wherein said condition of interest is the condition resulting 
from exposure to an environmental condition, wherein said environmental condition is 
cigarette smoke. 

8. A method of identifying one or more biomarkers to differentiate between a pair of 
disease and/or conditions, wherein said pair consists of a first and a second disease or 

15 condition of interest, and wherein each of said one or more biomarkers corresponds to an 
RNA transcript, comprising the steps of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having said first disease or condition of interest while not 
having said second disease or condition of interest, wherein each of said one or more 

20 transcripts is a candidate biomarker for differentiating said first disease or condition of 
interest from said second disease or condition of interest; and 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
with the level of each of said one or more genes transcripts in blood obtained from one or 
more individuals having said second disease or condition of interest while not having said 

25 first disease or condition of interest, 

wherein those compared transcripts which display differing levels in the comparison 
of step b) are identified as being biomarkers for differentiating said first disease or condition 
of interest from said second disease or condition of interest. 

9. The method of claim 8, wherein said first and second disease or condition of interest of 
30 said pair of disease and/or conditions, are selected from the group consisting of rheumatoid 

arthritis, osteoarthritis, schizophrenia, manic depression syndrome, liver cancer, hepatitis, 
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bladder cancer, kidney cancer, bladder cancer, testicular cancer, pancreatic cancer, kidney- 
cancer, liver cancer, stomach cancer, colon cancer, Chagas disease, heart failure, coronary 
artery disease, asymptomatic Chagas Disease, symptomatic Chagas Disease, alzheimer's 
Disease, allergies, systemic steroids, allergy, Type II Diabetes, obesity, hypertension, , 
hyperliidemia, lung disease, bladder cancer, asthma, psoriasis, thyroid disorder, irritable 
bowel syndrome, osteoporosis, migraine headaches, excema, NASH, Crohn's colitis, chronic 
cholecystitis, cervical cancer, cardiovascular disease, and neurological disease. 

10. The method of any one of claims 1-2, wherein said disease of interest is hypertension 
and wherein said one or more RNA transcripts are transcribed from one or more genes 
selected from the group consisting of the genes listed in Tables 1 A, IE, IP and 1Q, or 
wherein said disease of interest is obesity and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Tables IB, IF, 1R, and IS, or wherein said disease of interest is allergies and wherein said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Tables 1C, IT, and 1U, or wherein said disease of interest is 
type II diabetes, and wherein said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 1G, and wherein 
said marker does not identify the insulin gene, or wherein said disease of interest is 
hyperlipidemia, and wherein said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 1H, or wherein 
said disease of interest is lung disease, and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Tablell, or wherein said disease of interest is bladder cancer, and wherein said one or more 
RNA transcripts are transcribed from one or more genes selected from the group consisting of 
the genes listed in Tables 1 J and IK, or wherein said disease of interest is coronary artery 
disease, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table 1L wherein said marker 
does not identify a gene selected from the group consisting of ANF, ZFP and DMyHC, or 
wherein said disease of interest is rheumatoid arthritis, and wherein said one or more RNA 
transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Table 1M, or wherein said disease of interest is osteoarthritis, and wherein 
said one or more RNA transcripts are transcribed from one or more genes selected from the 
group consisting of the genes listed in 1AB, or wherein said disease of interest is depression, 
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and wherein said one or more RNA transcripts are transcribed from one or more genes 
selected from the group consisting of the genes listed in Table IN, or wherein said disease of 
interest is liver cancer, and wherein said one or more RNA transcripts are transcribed from 
one or more genes selected from the group consisting of the genes listed in Table IX, or 
wherein said disease of interest is chagas disease, and wherein said one or more RNA 
transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Table 1Z, or wherein said disease of interest is asthma, and wherein said one 
or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 1 AA, or wherein said disease of interest is ankylosing 
spondylitis, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table 1AM, wherein said 
disease of interest is manic depression, and wherein said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Table IAN, or wherein said disease of interest is alzheimers disease, and wherein said one or 
more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 1 AO and wherein said marker does not identify the 
APP gene, or wherein said disease of interest is cervical cancer, and wherein said one or more 
RNA transcripts are transcribed from one or more genes selected from the group consisting of 
the genes listed in Table 1 AQ, or wherein said disease of interest is gastric cancer, and 
wherein said one or more RNA transcripts are transcribed from one or more genes selected 
from the group consisting of the genes listed in Table 1 AR, or wherein said disease of 
interest is kidney cancer, and wherein said one or more RNA transcripts are transcribed from 
one or more genes selected from the group consisting of the genes listed in Table IAS, or 
wherein said disease of interest is testicular cancer, and wherein said one or more RNA 
transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Table 1 AT, or wherein said disease of interest is colon cancer, and wherein 
said one or more RNA transcripts are transcribed from one or more genes selected from the 
group consisting of the genes listed in Table 1AU, or wherein said disease of interest is heart 
failure, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table 1 AV, or wherein said 
disease of interest is hepatitis, and wherein said one or more RNA transcripts are transcribed 
from one or more genes selected from the group consisting of the genes listed in Table 1AW, 
or wherein said disease of interest is either Crohn's disease or colitis, and wherein said one 
or more RNA transcripts are transcribed from one or more genes selected from the group 
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consisting of the genes listed in Table 1AX, or wherein said disease of interest is 
osteoporosis, and wherein said one or more RNA transcripts are transcribed from one or more 
genes selected from the group consisting of the genes listed in Table 1 AY, or wherein said 
condition of interest is the taking of systemic steroids, and wherein said one or more RNA 
transcripts are transcribed from one or more genes selected from the group consisting of the 
genes listed in Tables ID, IV, 1 W, and IAD. 

1 1 . The method of any one of claims 5, 6 or 7, wherein said condition of interest is the 
administration of systemic steroids and said one or more RNA transcripts are transcribed 
from one or more genes selected from the group consisting of the genes listed in Tables ID, 
IV, 1W and AD, wherein said condition of interest is the administration of Celebrex, and said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 7B, wherein said condition of interest is the 
administration of Vioxx, and said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 7C, wherein said 
condition of interest is the administration of NSAIDs, and said one or more RNA transcripts 
are transcribed from one or more genes selected from the group consisting of the genes listed 
in Table 7E, wherein said condition of interest is the administration of Cortisone, and said 
one or more RNA transcripts are transcribed from one or more genes selected from the group 
consisting of the genes listed in Table 7F, wherein said condition of interest is the 
administration of Lipitor, and said one or more RNA transcripts are transcribed from one or 
more genes selected from the group consisting of the genes listed in Table 7H, and wherein 
said condition of interest is one of smoking, and said one or more RNA transcripts are 
transcribed from one or more genes selected from the group consisting of the genes listed in 
Table 71. 

12. A method of identifying one or more biomarkers specific for a group of related 
diseases and/or conditions of interest, wherein said group of related diseases and/or 
conditions comprise those diseases and/or conditions which display a similar phenotype 
and/or which originate from and/or effect the same physiological system, or which have the 
same or similar etiology, or which are medically classified together, and wherein each of said 
one or more biomarkers corresponds to a gene transcript, comprising the steps of: 

a) determining the level of one or more RNA transcripts expressed in blood obtained 
from one or more individuals having a disease and/or condition that is encompassed by said 
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group of related diseases and/or conditions of interest, wherein each of said one or more 
transcripts is transcribed from a gene that is a candidate biomarker for said group of related 
diseases and/or conditions of interest , 

b) comparing the level of each of said one or more RNA transcripts from said step a) 
5 with the level of each of said one or more genes transcripts in blood obtained from one or 
more individuals not having a disease and/or condition that is encompassed by said group of 
related diseases and/or conditions of interest, 

wherein those compared transcripts which display differing levels in the comparison 
of step b) are identified as being biomarkers for identifying one or more biomarkers specific 
10 for said group of related diseases and/or conditions of interest. 

13. The method of claim 12, wherein said group of related diseases and/or conditions of 
interest is selected from the following groups of related diseases and/or conditions: cancer, 
cardiovascular disease and neurological disease. 

14. The method of claim 13 wherein the disease of cancer is consists of: cervical cancer, 
15 stomach cancer, kidney cancer, testicular cancer, bladder cancer, liver cancer, lung cancer 

and colon cancer 

15. The method of claim 13 wherein the cardiovascular disease and/or condition consists 
of coronary artery disease, heart failure and hypertension. 

16. The method of claim 13, wherein said neurological disease and/or condition consists of 
20 alzheimer's disease, Manic Depression and 

Schizophrenia. 

17. A method of diagnosing or prognosing a condition of interest and/or a stage of 
progression or regression thereof, in an individual suspected as having said condition of 

25 interest, comprising the steps of: 

a) determining the level of one or more gene transcripts expressed in blood 
corresponding to the biomarkers identified in any one of claims 1 through 16 and claim 22, 
obtained from said individual, and 

b) comparing the level of each of said one or more gene transcripts in said blood 

30 according to step a) with the level of each of said one or more gene transcripts in blood from 
one or more individuals having said condition of interest, 
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c) comparing the level of each of said one or more gene transcripts in said blood 
according to step a) with the level of each of said one or more gene transcripts in blood from 
one or more individuals not having said condition, 

d) determining whether the level of said one or more gene transcripts of step a) classify 
5 with the levels of said transcripts in step b) as compared with levels of said transcripts in step 
c), wherein said determination is indicative of said individual of step a) having said disease 
of interest. 

18. A kit comprising one or more biomarkers identified in any one of claims 1 through 16 and 
10 claim 22. 

19. A kit for diagnosing or prognosing a condition of interest comprising: a) two gene- 
specific priming means designed to produce double stranded DNA complementary to a 
transcript which correlates to one or more biomarkers identified in any one of claims 1 

15 through 16 and claim 22, wherein said first priming means contains a sequence which can 
hybridize to said RNA transcript to create an extension product and said second priming 
means capable of hybridizing to said extension product; b) an enzyme with reverse 
transcriptase activity c) an enzyme with thermostable DNA polymerase activity and d) a 
labeling means; wherein said primers are used to detect the quantitative expression levels of 

20 said RNA transcript(s) in a test subject. 

20. A kit for monitoring a course of therapeutic treatment of a condition of interest, 
comprising a) two gene-specific priming means designed to produce double stranded DNA 
complementary to a transcript which correlates to one or more biomarkers identified in any 

25 one of claims 1 through 16 and claim 22, wherein said first priming means contains a 

sequence which can hybridize to said RNA transcript to create an extension product and said 
second priming means capable of hybridizing to said extension product; b) an enzyme with 
reverse transcriptase activity c) an enzyme with thermostable DNA polymerase activity and 
d) a labeling means; wherein said primers are used to detect the quantitative expression levels 

30 of said RNA transcript(s) in a test subject. 

21 . A kit for monitoring progression or regressionof coronary artery disease, comprising: a) 
two gene-specific priming means designed to produce double stranded DNA complementary 
to a transcript which correlates to one or more biomarkers identified in any one of claims 1 
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through 16 and claim 22, wherein said first priming means contains a sequence which can 
hybridize to said RNA transcript to create an extension product and said second priming 
means capable of hybridizing to said extension product; b) an enzyme with reverse 
transcriptase activity c) an enzyme with thermostable DNA polymerase activity and d) a 
labeling means; wherein said primers are used to detect the quantitative expression levels of 
said RNA transcript(s) in a test subject. 

22. The method of claim 9, wherein said first and second disease or condition of interest 
of said pair of disease and/or conditions, are selected from the group consisting of 
rheumatoid arthritis and osteoarthritis, respectively, schizophrenia and manic depression 
syndrome, respectively, liver cancer and hepatitis,respectively, bladder cancer and kidney 
cancer, respectively, bladder cancer and testicular cancer, respectively, testicular cancer and 
kidney cancer, respectively, liver cancer and stomach cancer, respectively, liver cancer and 
colon cancer, respectively, stomach cancer and colon cancer, respectively, Chagas disease 
and heart failure, respectively, Chagas disease and coronary artery disease, respectively, 
coronary artery disease and heart failure, respectively, asymptomatic Chagas Disease and 
symptomatic Chagas Disease, respectively, Alzheimer's Disease and Schizophrenia, 
respectivley, and Alzheimer's Disease and Manic Depression, respectively. 
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Figure 2 
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Figure e 
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Figure 7 
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Figure 1 1 
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Figure 16 
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Figure 19 
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Figure 20 
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Figure 21 
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Figure 22 
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Figure 23 
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Figure 24 
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Figure 25 
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Figure 26 




23/25 



WO 2004/112589 



PCT/US2004/020836 




24/25 



WO 2004/112589 



Figure28 



PCT/US2004/020836 




25/25 



